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DISTURBANCES AND SILENCES IN THE PATIENT’S SPEECH 
IN PSYCHOTHERAPY! 


GEORGE 


*. MAHL 


Yale University 


HIS paper presents exploratory research 

on the measurement of two expressive 

properties of the patient’s speech in 
psychotherapy and on their use as indices of 
anxiety. First, general considerations that 
formed the background of this work will be 
stated briefly. 

Linguistic measures of patient anxiety dur- 
ing therapeutic interviews should be useful in 
several areas of psychological research. (a) If 
such measures are correlated with physiological 
indices, certain psychosomatic relations in 
human behavior can be more readily investi- 
gated. The need to pursue such problems arose 
during the course of one clinical study (22) and 
a series of experimental studies (16, 17, 18, 
20) of anxiety and HCl! secretion in dogs, 
monkeys, and humans. This need was the 
original, though now receding, impetus for 
the present work. (6) Much can be learned 
about the “‘psycholinguistics” of anxiety that 
could be of value to students of language and 
of cognitive processes. (c) Because anxiety is a 
nodal variable in psychotherapy, anxiety 
measures should facilitate the objective in- 
vestigation of psychotherapy. For example, 
descriptions of changes during the course of 
psychotherapy or comparisons of the effect of 
various therapeutic techniques and procedures 
will be enhanced if they effectively portray 
the changes or differences in the patient’s 
anxiety. A review of the literature shows 
that only a few objective studies of therapy 
have tried to measure anxiety. 

! This report deals with one aspect of current re- 
search on “The Patient’s Language as Expressive 
Behavior.” Most of the work presented here was sup- 
ported by the Foundations’ Fund for Research in Psy- 
chiatry. Some of it was also supported by the USPHS, 
Grant M-1052. The writer wishes to express his appre- 
ciation to Richard Karpe, M.D., and John Higgins, 
M.D., for many important contributions to this pro- 
gram, and to Robert Abelson, Ph.D., for advice on sta- 
tistical techniques. 

A more extended discussion of this exploratory work, 
including a manual of instructions for training people 
to score speech disturbances and a circuit diagram of 
the timing apparatus, is available (19). The high-fidelity 
facilities used to obtain the tape recordings have been 
described elsewhere (21). 


The proceding interests require the measure- 
ment of anxiety as it occurs immediately in the 
therapeutic session. This process is different 
from determining characteristic anxiety reac- 
tions, defenses, etc., in the patient’s general 
behavioral repertory such as one attempts for 
diagnostic purposes. For the present goals, 
one wants to assess the current behavioral 
state of the patient. 

In the therapeutic interchange, the intensity 
of the patient’s anxiety may fluctuate mark- 
edly during the course of a single interview. 
Continuous change must also be anticipated 
in most of the variables with which one would 
wish to correlate patient anxiety, such as 
physiological measures, various categories of 
therapist behavior, patient-content categories, 
or other linguistic measures. As a result, pro- 
cedures giving only single measures of anxiety 
for large segments of interviews or for entire 
sessions taken as the units might obscure more 
precise covariations that inevitably would be 
of interest. It seems that the most generally 
useful procedure would involve the measure- 
ment of some relatively continuous aspect of 
the patient’s behavior. The data could then 
be summated or averaged for whatever tem- 
poral or behavioral units were required for 
specific problems. 

Sound recordings supply one with a con- 
tinuous stream of behavior, the language of 
the participants, from which linguistic meas- 
ures of anxiety meeting the preceding require- 
ments can potentially be derived. The basic 
working hypothesis underlying the present use 
of recordings for this purpose has been that 
the most valid linguistic measures of anxiety 
will be those based on the behavioral or “‘ex- 
pressive” aspects of the speech rather than 
those based on manifest verbal content analy- 
sis. A full discussion of the reasoning back of 
this assumption is beyond the scope of the 
present paper, but some of it appears in the 
discussion. This working assumption, of course, 
is one that must be re-examined during the 
research in progress. 

Empirically, two of the many behavioral 
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attributes of speech in the interview that are 
useful to the therapist in assessing anxiety in 
the patient are (a) disturbances in speech 
called “jumbled,” “confused,” or “flustered” 
speech, and (6) hesitancies and longer silences 
by the patient when he is free and motivated 
to talk. Theoretically, silence (and perhaps 
speech disturbance) may be regarded as a 
defense motivated by anxiety evoked by 
ideational events or by the nature of the 
interpersonal relation. Speech disturbances and 
short hesitancies may also be conceived as 
predominantly indirect linguistic consequences 
of anxiety that do not have the insirumental 
function of reducing anxiety. This notion is 
based on the assumption that one effect of 
anxiety, regardless of its source, is to disrupt 
all complicated ongoing behavior, irrespective 
of its behavioral relation to the source of the 
anxiety. Here, speech is merely an excellent 
instance of such complex behavior susceptible 
to the disruptive effect of concurrent anxiety. 

This exploratory research was intended to 
devise reliable methods of measuring these 
two attributes of speech from sound recordings 
of interviews, to study the general discrimina- 
tory properties of the measures, and to explore 
a procedure for validating these measures as 
indices of patient anxiety. 


THe MEASURES 

Speech-Disturbance Measure 

Upon analyzing instances of obvious speech 
confusion in interviews, it is found that they 
consist of a variety of separate identifiable 
forms of disturbances occurring at a rapid rate. 
These same disturbances occur at lower rates 
at times when the patient’s speech is not 
clinically experienced as confused. A quanti- 
tative measure of this variability can be 
obtained by identifying the various disturb- 
ances in a verbatim transcript and then 
computing the following ratio for any given 
language sample: 
Speech-Disturbance Ratio = 

N Speech Disturbances 
N “Words” Spoken by Patient’ 

In this paper Speech-Disturbance Ratios for 
successive two-minute segments of therapeutic 
interviews and for certain content and inter- 
action phases of interviews are considered. 
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In practice the speech-disturbance categories 
are marked in the transcript while listening 
to the recording and reading the transcript. 
The “word” count used to date has included 
the number of completed words, the number 
of incomplete words, the number of “‘sounds”’ 
caused by stuttering and incoherent sounds, 
and the number of “‘ah’s.”’ The value placed 
in the denominator could probably just as well 
be some other measure of the amount of talking 
done by the patient—such as the “number of 
seconds of patient talk.”” The requirement is 
only to have a disturbance measure that is 
independent of the verbal output by the 
patient. 

The speech-disturbance categories presently 
used are listed below and briefly defined. These 
were developed empirically by studying re- 
cordings and typescripts, and noting dis- 
turbances in individual words and in the 
word-word progression that were sensed as 
superfluous or distorting to the communication 
of the content. Judging from experience in 
scoring thirty-odd interviews for twelve dif- 
ferent patients and speech samples of 40 
undergraduates, this set seems to be generally 
adequate. 


Disturbance Categories 


1. “Ah.”’ Wherever the definite “ah” sound (as dis- 
tinguished from “um,” etc.) occurs, it is scored. 
2. Sentence Correction. A correction in the form or 
content of the expression while the word-word progres- 
sion occurs. To be scored, these changes must be sensed 
by the listener as an interruption in the word-to-word 


“er” 


sequence. 
3. Sentence Incompletion. An inter- 
rupted, clearly left incomplete, and the communication 
proceeds without correction. 
4. Repetition. The serial superfluous repetition of one 
usually of one or two words. 


expression is 


or more words 
5. Stutter 
6. Intruding Incoherent Sound. A 
absolutely incoherent as a word to the listener. It 
merely intrudes without itself altering the form of the 
expression and cannot be clearly conceived of as a 
stutter, omission, or a tongue-slip (though some may 


sound which is 


be such in reality) 

7. Tongue-slip. This category includes neologisms, 
the transposition of words from their correct serial 
position, and the substitution of an unintended for an 
intended word. 

8. Omission. Parts of words, or rarely entire words, 
may be omitted. Contractions are exempted. Most 
omissions are of terminal syllables of words. 


Real examples of these disturbance cate- 
gories are contained in the following interview 








if- 
40 
ly 


ord 


er- 
on 


me 


rds, 
lost 


ite- 


lew 





excerpt. In the transcript an asterisk (*) is 
placed at the point of the disturbance. To the 
left are the categories in the corresponding line 
of the excerpt. If there is more than one dis- 
turbance in a line, the category names are in 
sequence with the asterisks. Since the actual 
scoring is done only when listening as well as 
reading, some of it may not be completely 
clear from this written material alone. Poor 
grammar, inaudibility of the recording, and 
interruptions by the therapist are not grounds 
for scoring the categories. 


Illustrative Scored Excerpt 
Abbreviations 


1. In transcript: P = patient; T = therapist 


2. In left margin: SC = sentence correction 
R = repetition 
St = stutter 
O = omission 
T-S = tongue-slip 
Inc = incompletion 
IS = incoherent sound 
P: My impression of my relation to 


D-(son) have always been that the 
T-S, SC reason that I don’t* *didn’t 
seem to feel the love for him that 
I felt for J-(daughter) was that 
during the first sixteen months of 
his life I was away. I didn’t grow 
up with him. If there was any 
jealousy of D- it was in relation to 
his in-laws. Now that is very pos- 
sible. Although it’s something 


Ah which I also suppressed. Ah*.. . 

and the reason I say it’s possible 
2 St ‘cause it sort of well le.* le.* leaves 
Ah, R a ah* .. . a* sort of memory. 


When I (?) I... 
T: You’re jealous of his in-laws? 
P: Of his in-laws, yeah. Because he 
was brought up with them till... 
R until* I came home. He was born 
T-S, O, SC in their hou** . *hospital and 
came to their house, and my wife 
lived with her parents 
T: Ho . . . how do you mean you’re 
jealous of his in-laws? 


Ah P: Well...ah*... when I first came 
home, and for the first year or so, 
Ah or more than a year... ah*.. D- 


was more prone to turn to his 
grandfather and grandmother than 
he was to me. 
T: Mmmhnn. 
P: And although I understood it, 
there was a certain amount of... 
Ah on”. well not bitterness, I 
wasn’t bitter about it, but a certain 
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SC amount of .. .*a sort of resent- 
ment, a mild type of resentment. 
T: You make this sound so. . . ah 
P: Well 


T: So diluted 

P: Well it was diluted. I mean it 
wasn’t something which I felt 
keenly enough to be angry at his 
grandparents, let’s say, or with D- 
himself. I mean I realized that he 
had grown up with them 

T: Mmmhnn 


P: And therefore, it was more natural 
Inc for him to. . .* until he became... 
SC, St *completely o.* overcame his . 
Ah ah* . . . strangeness to me, and it 
Ah took quite a long time. Ah*. . 
R then there was . . .* there was a 


certain amount of resentment. It 
wasn’t directed against the parents 
IS, O (?), IS, St or dec* *was...den®...ac.* 
actually directed against a circum- 
stance which kept me away. And 
0 it was a resentment which re*... 
SC, Ah, SC *in a certain. .ah*... * toa cer- 
tain extent reflected itself also in 
the (clears throat) feeling I had 
toward people who had remained 
behind and had made money. 
T: Mmm. Your in-laws make money? 
P: Yes. My father-in-law made a lot 
of money from the war. 


The Patient-Silence Quotient 


A Patient-Silence Quotient is the silence 
measure described here, although other means 
of assessing silence have been explored in this 
research. For any given segment of an inter- 
view this is defined as follows: 


Patient-Silence Quotient = 


N Seconds of Silence 


N Seconds Available to Patient to Talk’ 


Silence Quotients for successive two-minute 
segments in the interview and for certain 
content and interaction phases in the inter- 
view are considered later in the paper. 

The basic apparatus for obtaining the 
relevant activity data consists of a tape re- 
corder and a pair of hand switches each 
connected with the clutch of a Springfield 
timer. The observer’s task is to listen to the 
recording and to press one switch whenever 
and for as long as the patient talks and to 
press the other switch in comparable fashion 
for the therapist’s talking. Each clock runs 
accumulatively whenever its hand switch is 
closed. At the end of the interview segment 
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involved, the total number of seconds each 
participant was observed to be talking is read 
from the clocks. The values for computing the 
Silence Quotient for the segment are then 
derived as follows: 

N seconds in the segment 

N seconds talk by therapist 

N seconds talk by patient 

- (b + c) 
a-—b 


i= 
b= 
c= 
N seconds of silence = a 
N seconds available to patient = 


When successive two-minute Silence Quotients 
are desired, apparatus with the following 
operating features is used. The observer 
momentarily closes a foot switch which starts 
the tape recorder, a telechron motor geared for 
L6 r.p.m., and which closes relay switches in 
series with the clutches of the accumulating 
clocks. He then operates the hand switches 
as described above. A microswitch is auto- 
matically opened when the telechron motor 
completes one revolution in two minutes. This 
instantly releases the relays to stop the tape 
recorder and inactivate the clock clutches. 
The clock readings are recorded, the clocks 
are reset to zero, and this sequence repeated 
for the next two minutes, etc. By using this 
apparatus the raw data for the two-minute 
Silence Quotients of a 45-60 minute interview 
can be obtained in about one and a half hours 
by an experienced operator. 

To obtain the Silence Quotients for the 
larger and variable time segments reported 
below, there are some minor changes in pro- 
cedure and use of the apparatus which will 
not be described here. 


Reliability of the Measures 

Three people have learned to score speech 
disturbances. The interobserver reliability has 
been determined by correlating the total NV of 
disturbances scored by independent judges in 
unselected samples of transcript pages (V = 
28-65) for several different patients(N = 3-5). 
The average product-moment intercorrelation 
was .94. 

One observer has learned to obtain Silence 
Quotients with the apparatus and procedure 
described above. Test-retest runs at two to 
four week intervals were made for five inter- 
views. Three interviews were for one patient, 
and one each were from two other patients. 
The average product-moment correlation of 
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the test-retest two-minute Silence Quotients 
was .96, 


GENERAL PROPERTIES OF THE MEASURES 


Thirty-odd interviews from twelve patients 
have been scored for speech disturbances and 
measured for Silence Quotients. Most of these 
interviews have come from two patients under- 
going psychotherapy. The remainder of this 
paper pertains to the material from these two 
patients. This section of the paper treats only 
the successive two-minute measures. 

Tables 1 and 2 contain descriptive statistics 
of the two-minute measures for Mrs. Y and 
Mr. Z. Speech disturbances and silence are 
prominent modes of behavior for these pa- 
tients from the purely quantitative standpoint. 
On the average, Mrs. Y produces one speech 
disturbance per 23 spoken “words” and Mr. 
Z one speech disturbance per 16 “words.” 


TABLE 1 
SumMARY OF Two-MrinvcTE SpEeecu-DISTURBANCE 
Ratios FOR Mrs. Y AND Mr. Z 
— ED 
Mrs. Y Mr. Z 
Inter- N | | Inter- | N | 
view | 2’ Mean | SD view 2’ Mean | SD 
No. | Ratios | No. | Ratios | 
betee ww 
1 2 | .037 |.019| 2 | 30 os7 | .o21 
2 25 | .O41 | 014] 15 30 068 | .023 
5 30 038 .015 17 | 31 .060 | .040 
10 33 052 | .018 20 29 062 | .026 
13 28 042 | .015 | 26 32 064 | .034 
17 28 049 026 67 30 055 .029 
20 26 036 013 83 30 075 024 
26 29 048 024 88 30 050 | .027 
Mean = .043 018 Mean = .061 028 
TABLE 2 
SuMMARY OF Two-MINUTE SILENCE QUOTIENTS FOR 


Mrs. Y AND Mr. 7 





———— _—— 
Mrs. Y | 
Inter Inter- N 
view 2 Mean | SD | view 2’ Mean | SD 
No. | Ratios | No Ratios 
5 | 30 260 | .25 2 30 370 | .133 
10 29 291 | .122 5 30 467 | .160 
13 28 383 137 is 30 455 | .071 
17 28 445 220 17 30 579 | .196 
20 26 34 224 20 29 554 | .099 
26 29 . 308 173 26 30 .587 | .137 
67 30 567 | .163 
8&3 30 516 | .004 
88 30 570 | .134 
Mean = .332 . 188 Mean = .518 -132 
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On the average, Mrs. Y is silent 33 per cent 
of the time she could talk and Mr. Z, 52 per 
cent of the time he could talk. These are not 
atypical patients in these respects. 

Tables 1 and 2 suggest that these measures 
differentiate therapeutic interviews with re- 
spect to mean level and variablity. Figures 1 
and 2 contain graphs of the successive two- 
minute Speech-Disturbance Ratios and Pa- 
tient-Silence Quotients for Mrs. Y’s 17th 
session. These graphs illustrate an impression 
one gets that the measures tend to portray 
trends or phases within many interviews, as 
well as differentiating between interviews for 
a given patient. Confirmation of these points 
would be compatible with the belief that the 
scores were behaviorally meaningful in general, 
and not just artifacts of the laboratory. 

Bartlett’s test and a simple analysis of vari- 
ance were used to test for the between-hour 
differences in variability and means. The re- 
sults presented in Table 3 show that there are 
significant between-hour differences in both 
variability and means for both measures. Since 
the variances and the sample N’s meet the 
conditions for which no appreciable error is 
made in interpreting the F test (10), it can be 
concluded that the differences in means are 
true differences and not due to the within- 
interview heterogeneity of variance. 

The Wald-Wolfowitz run test was used to 
evaluate the presence of phases in the two- 
minute graphs. This test is described by Moses 
(24). Briefly, in the present application, the 
median of the two-minute measures for each 
hour is determined. A “run” is defined as a 
series of one or more scores falling on the same 
side of the median. The number of runs for an 
interview is determined and the difference 
between the observed and the theoretically 
expected number is measured in standard error 
units. Under the null hypothesis, in a given 
sample of interviews, the sum of the deviate 
scores resulting from the run test would be 
zero. The standard error of the sampling dis- 
tribution of such sums would be +~/N. The 
significance of the departure of the obtained 
sum from zero is then evaluated with the 
normal probability table. 

The hypothesis tested here was that in each 
patient’s sample of interviews there would be 
on the whole fewer runs, for both measures, 
than if the successive two-minute measures 
were random uncorrelated scores. The hypothe- 
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WO MINUTE INTERVALS 
Fic. 1. Speecn-DistuRBANCE RATIOS IN SUCCESSIVE 
Two-Minute INTERVALS OF Mrs. Y’s 
17TH SESSION 


IENT SILENCE QUOTIENT 
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PAT 
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CCcE ive Tw MINUTE INTERVALS 
Fic. 2. PATIENT-SILENCE QUOTIENTS IN SUCCESSIVE 
Two-MInvuTE INTERVALS OF Mrs. Y’s 
17TH SESSION 


TABLE 3 
Tests OF BETWEEN-INTERVIEW DIFFERENCES IN 
Two-MINUTE MEASURES 














Bar'lett’s Analysis of 
‘ st Vari 
Measure | Patient | ss — 
| x? o | # i. 
a = = 
Speech disturbance Mrs. Y | 24.39 | <.001 | 2.7 | <.02 
ratio Mr. Z 16.14 | <.05 | 2.19 | <.05° 
| 
| | 
Silence quotient Mrs. Y 11.18 <.05 | 5.55 | <.001 


Mr. Z 37.22 <.001 | 9.88 | <.001 





* The violation of the assumption of homogeneous variance 
might cause this to be spuriously “significant.” The true value 
might be as high as .07 (see 10). 


sis of too few runs was posed because of general 
knowledge about the interaction between the 
therapist (the writer) and these patients. He 
allows the patient to become anxious as a result 
of his association trends and the interaction, 
but he also permits the patient to resort to 
defensive maneuvers for varying periods of 
time depending on the circumstances. The 
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therapist also at times consciously intervenes 
to reduce the patient’s anxiety. At times he is 
insensitive to the patient’s anxiety, in which 
case the patient usually succeeds in success- 
fully maneuvering as long as the therapist 
follows the lead of the patient. At times the 
therapist changes the topic himself because he 
too becomes anxious. Of course anxiety is not 
always reduced either by the patient or the 
therapist, so every hour would not be expected 
to show clear phasic changes in a purported 
anxiety measure. 

One of Mr. Z’s interviews, the 83rd, was 
omitted from this test. This was because over 


TABLE 4 
RESULTS OF WaLp-WoLrow1Ttz Run Test APPLIED 
TO SuccESSIVE 2-MINUTE SPEECH 
DISTURBANCE RATIOS 


= So 





Mrs. Y Mr. Z 
N Obtained N Obtained | s 
N Expected Runs* . N Expected Runs®* | 
—7.5 —2.95 —3.0 —1.11 
—1.5 57 4.0 |; —1.48 
4.0 1.48 ee 18 
6 —.27 —6.0 |} —2.22 
3.0 —1.2 —4.0 —1.48 
-3.0 1.2 +3.5 +1.32 
+.5 +.2 +2.0 +.72 
41.5 +53 
> = -6.%4 = = —4.43 
p< 01 p< .05 
(one tail) (one tail) 


* For individual interviews 


TABLE 5 
Resvutts oF Watp-Wo.trowitz Run Test APPLIED 
To SuccesstvE 2-MINUTE SILENCE QUOTIENTS 


Mrs. ¥ Mr. Z 

N Obtained N Obtained - jo 

N Expectec’ Runs® | a N Expected Runs* ’ 
—9.0 —3.59 —9.0 | 3.33 
—5.0 —1.92 —7.0 —2.59 
—3.0 } —1.11 —7.0 } —2.59 
—4.5 | —1.70 —1.0 | —.37 
—$.0 ~ 38 —8.0 | —2.96 
=f | ant —9.0 | $33 
—4.5 | —1.70 
—7.0 —2.59 
> = —8.89 > = —19.46 
p< .001 p< .001 

(one tail) (one tail) 


* For individual interviews. 


a year before any of this work started, the 
writer has evaluated it and used it for demon- 
stration purposes because of its illustration of 
sustained conflict with many oscillations in 
anxiety level. Thus, it was already known that 
it would be inappropriate to include this 
hour in the present test of too few runs. 
The results of the Wald-Wolfowitz tests 
presented in Tables 4 and 5 confirmed the 
prediction of few runs for both measures in both 
patients. An obvious refinement of this test 
would be to predict independently which 
interviews would show more and which less 
than the number of runs expected by chance. 


A PROPOSED VALIDATION PROCEDURE 


Preliminary Findings 

The measures of speech disturbance and 
silence were selected for study because there 
were empirical and theoretical reasons for be- 
lieving they would be valid linguistic indices of 
concurrent anxiety in the speaker. The pre- 
ceding findings show that these two speech 
attributes discriminate something in the 
patient’s behavior. The questions now are: is 
it variability in anxiety that is being dis- 
criminated and how can this be determined? 

During therapeutic and_ while 
studying recordings it often appears that in- 
terviews are divisible into natural segments or 
phases, each of which could be assigned to a 
single theme of content or interaction, and that 
the patient becomes anxious and conflictful 
in some, but becomes less anxious in others. 
In some of the latter, it can often be observed 
that the patient himself has changed the topic 
or started a new line of interaction with the 
therapist in such a way that his behavior can 
be interpreted as a relatively sustained and 
successful defensive maneuver. 

During live interviews or in listening to 
recordings, judgments of such changes in 
anxiety as have been mentioned are based in 
large part upon changes in expressive speech 
attributes. The assumption underlying the 
validation procedure and test to be described 
is that given adequate context for interpreta- 
tion, it would be possible to judge such phases 
in typescripts only (not recordings). If this 
were done validly, and if the Speech-Disturb- 
ance Ratio and Silence Quotient are valid 
indices of patient anxiety, then these two 
measures should be greater in the anxious or 


sessions 
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conflictful phases than in the low anxious or 
defensive phases. 

A test of these predictions has been made for 
Mrs. Y, using interviews 5, 10, 13, 17, 20, 26 as 
the test hours. Before the test was made, ad 
hoc clinical evaluations of a similar nature with 
interviews from Mr. Z and from other patients 
appeared to give promising results. In addition, 
almost the same experimental-clinical pro- 
cedure described below had been followed with 
still different interviews of Mr. Z from his first 
26 sessions. The results were essentially the 
same as for Mrs. Y, and are presented else- 
where (19). They are not described here be- 
cause the possibility of contaminated phase 
judgments was not as well controlled as in the 
test with Mrs. Y’s interviews. 

With Mrs. Y, the first major problem was to 
prevent contaminated criterion judgments. 
The writer had been the therapist, but the test 
interviews occurred slightly over two and one- 
half years before this validation test was made. 
The writer had not previously replayed the 
recordings or studied transcripts of these inter- 
views. After practicing with other interviews, 
a secretary edited verbatim transcripts and 
prepared clean scripts that did not contain any 
speech disturbances, any annotations concern- 
ing pauses, or any explicit reference to silence 
by the participants. These scripts were no 
longer exactly verbatim, of course. With these 
precautions, it was felt that no serious con- 
tamination of the phase judgments occurred. 

The judgment of the proper anxiety category 
for the phases of content or interaction requires 
intimate knowledge of the context. One of the 
main reasons for this is that the judgment can- 
not be made on the basis of manifest content 
alone. The context includes general knowledge 
of the patient and of the dynamic setting of a 
given therapeutic session. The development of 
the context is a major problem. A good deal 
was known about the patient since the judge 
had been exposed to over 100 therapeutic 
interviews with her. A more immediate context 
of the test hours was obtained by listening to 
the recordings for the first 29 interviews, 
excepting the test hours. Notes made of each of 
the nontest hours and the complete edited 
typescripts were then reviewed in sequence. 
Before the final judging of phases in any given 
test hour, a rough clinical description of the 
therapeutic situation at the time was written. 

After the interviews were divided into phases 


TABLE 6 
SPEECH-DISTURBANCE RATIOS AND SILENCE QUOTIENTS 
IN Puases OF INTERVIEWS. Mrs. Y 


— SS [= 


Phase 
Measure | Statistic | Low Angious | Serie 
or Defen- °% Con- 
sive flictful 
Speech-disturb- N 15 15 
ance ratio Mean 038 049 t = 2.68 
e< .01 
SD 013 .009 
Silence quotient N 15 15 
Mean 316 -427 s = 3.51 
’< .001 
o 0032 0122 PF = 3.94 
o< .01 
SD 054 107 


and categorized as to anxiety type, the phases 
were marked off in the original verbatim 
typescripts. A second person then determined 
the Silence Quotients and the Speech-Disturb- 
ance Ratios for the phases. Just as the phase 
judge had no prior knowledge of the objective 
measures, the scorer of the objective measures 
had no knowledge as to the identity or meaning 
of the phases. 

A total of nineteen defensive and/or low 
anxiety and nineteen anxious or conflictful 
phases were judged in the five test hours. For 
any given interview, however, there had not 
always been judged an equal number of each 
type of phase. To control for the confounding 
of “hour effects” demonstrated earlier, equal 
numbers of the phase types were randomly 
selected for each interview. This procedure 
produced a sample of fifteen phases in each 
group. Table 6 contains the pertinent statistical 
data. Both objective measures were signifi- 
cantly higher in the anxious or conflictful 
phases. The mean Speech-Disturbance Ratio 
increases by 29 per cent, while the mean Silence 
Quotient increases by 35 per cent in the anxious 
over the other phases. It will be noted that the 
Silence Quotients were also significantly more 
variable in the anxious phases. Nevertheless, 
the p value of the mean difference is such that 
one can safely conclude that there is a signifi- 
cant difference in means as well as in vari- 
ability. 


Illustrative Interview 


The material for Mrs. Y’s 13th interview illustrates 
the kind of interpretive phase judgments involved and 
the way in which the linguistic measures vary with the 
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phases. Before proceeding to the interview itself, some 
preliminary information concerning the patient is pre 
sented. 

Mrs. Y, a college graduate, was in her late thirties 
at the onset of treatment. She had been married for 
eleven years. She had two daughters, aged 744 and 9. 
A third pregnancy had terminated with a miscarriage 
some seven months earlier. The patient sought treat- 
ment ostensibly for relief from recurring duodenal 
ulcers and because of a general feeling of unhappiness 
and dissatisfaction in her relations with her husband 
and children. This was consciously related to frustration 
of strong dependency needs }»sed on a lifelong history 
of repeated rejection by ‘oth of er parents as well as 
well as other significant } eople. As therapy progressed, 
vith a great deal of Oedipal 
more, important in 

This conflict was 
ut was general- 
For example, 

le and was 


a strong sex-fear conflict 
flavoring seemed equally, i: not 
determining her present troubles 
not only specific to genital behavior 
ized to many modes of feminine behavi: 
even though she moved in a fashionable « 
not restricted in income, she was afraid tu ‘uy and 
wear the kinds of attractive clothing worn . ° her 
peers. She always chose “practical” and “adequ 
clothes. While she was aware of “‘missing the boat” 
her sexual relations with her husband, the sexual con- 
flict and the general fearful avoidance of femininity and 
womanhood was almost completely unconscious in the 
early stage of therapy 

At the time of the 13th session the patient had been 
in treatment for a month. During this period she hac 
manifested and felt a positive attachment to th 
therapist. She was conscious of the dependency aspe 
of this attachment but not of an inferred unconscio s 
sexual aspect. Consciously she had felt frightened t 
the dependent attachment, being afraid that t e 
therapist weuld reject her as her father had done ~- 
revocably and traumatically when she was ten, and .s 
she alleged a previous therapist had done. 

There were some unusual aspects of her treatm: it 
arrangement that are also pertinent to the evaluat on 
of the 13th session. She knew the therapist was do 1g 
research on both recorded therapy interviews and on 
the psychophysiology of peptic ulcer patients. he 
also knew that her therapy was included in these stv .ies 
and that the therapist had accepted her for treat: ent 
partially for these reasons. She was aware of th in- 
energy in each case an the 


vestment of time and 


importance of each patient for the research pro cam. 
In fact, she had felt this as an attraction to ente into 
treatment with this particular therapist. 

Just before the 13th session the patient ha been 


informed by her obstetrician that she was pr :snant. 
The interview is centered on this topic. Becomir * preg- 
nant raised many psychological problems for | :r, but 
two aspects are particularly prominent in this pa ticular 
interview. First, she tells the therapist that she is 
definitely pregnant and, second, she and the t! erapist 
explore an irrational fear she has of telling her nother 
that she is pregnant. 

Prior to judging phases in the hour, it w: 
that both of these aspects were anxiety-lade 
for her. She had suspected some two weeks « urlier that 
she might be pregnant and at that time av oke during 
the nights before her 8th and 9th sessions »reoccupied 


known 
matters 
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in part with how the therapist would react to this news. 
She anticipated that if she were pregnant her therapy 
would be interrupted and that this would let the 
therapist down in his research program. Her thoughts 
at 3:30 a.m. on the day of the 9th session were: 

“Should I tell Dr. Mahl?” She anticipated that he 
would think “‘That’s just like a woman. There goes my 
research program. She is gumming up the works.” She 
also said to herself, “Your research program depends in 
part on me.” 

Also in the 9th interview she stated that she was 
afraid to tell her mother she might be pregnant. After 
the 13th interview her mother came to visit for several 
days as had been previcusly arranged. The day her 
mother came, the patient became “‘ill’’ and had to go to 
bed. Only after this did she tell her mother she was preg- 
nant. (The mother reacted positively.) These observa- 
tions show that the irrational fear of telling her mother 
which is explored in the 13th interview was real and 
intense. 

The 13th session. This interview is now summarized 
by phases. For each phase, the content or interaction is 
first summarized, then the category into which the 
phase was placed and the reasons at the time of the 
judgment for doing so are stated. The Speech-Disturb 
ance Ratios and Silence Quotients for the phases are 

resented in Fig. 3. 


Phase 1 


After spending 30 seconds in idle chitchat about the 
weather, Mrs. Y abruptly comments: “I got off to a 
bum start this morning. I’m mad! At least I got mad 
at my family before breakfast even.” She proceeds to 
state that she woke up with a hoilow feeling in her 
stomach and ate a cracker. She casually interposes the 
news that since last time she has discovered she is 
definitely pregnant because she “. .. had an A.Z. test, 
whatever that is.”” She immediately goes on to describe 
in detail her “bum start.” Her two daughters had pro- 
voked her, one by getting toothpaste on a clean dress 
and the other by putting on a clean dress instead of one 
she had worn before. The patient had become irritated 
and her breakfast is “sitting right there in a lump.” 
She closes this account by saying that these are more 
unnecessary things she has to cope with. 

Interpretation of the phase. It is inferred that the 
patient is afraid of the therapist’s reaction to the defi- 
nite knowledge that she is pregnant. The nature of this 
fear is the same as she described earlier in the 9th 
session. This anticipatory fear motivates her to inform 
the therapist of the A.Z. test results as casually as 
possible and to induce a sympathetic attitude in him 
by portraying herself as the “victim” of her daughters. 
The content of the phase is instrumental in doing these 
things. By being receptive to the account, the therapist 
“aids” the patient in doing these things. For these 
reasons, this is judged to be a relatively successful 
defensive maneuver in which the anxiety of the patient 
is minimal. Both linguistic measures were at their lowest 
level in this phase 


Phase 2 


The patient spontaneously changes the subject 
saying straightforwardly now that she had the A.Z. 





he 
-b 
re 


ct 


DISTURBANCES AND SILENCES IN PSYCHOTHERAPY 


06 77 CONFLICTFUL- 
V/ ANXIOUS 





DEFENSIVE MANEUVER— 
LOW ANXIETY 


SPEECH DISTURBANCE 
LEVELS 











5 
= 

< 

PL) a i 
i) 

UO 

yA 

< 

Q 

~~ 

5.04 

a 
































>9PEECH 
T 
IN 
fost rs 


EXNN 








QUOTIENT 




















.20 

















#1 #2 #3 #6 
+ lj 6 1 
DURATION OF PHASES IN MINUTES 
SILENCE LEVELS 
= al A; g 
< / 
Z va 
— 30 
< /) 
a. 
#1 #2 Wy #3 4 #5 #6 
4 ll 6 ll 4 | V4 


DURATION OF 


PHASES IN MINUTES 


Fic. 3. Sprecu-DisturBANCE RATIOS AND PATIENT-SILENCE QUOTIENTS IN THE 
SUCCESSIVE PHasEs oF Mrs. Y’s 13TH Session 


test and it was positive. Then she says, “Now I’m 
trying to figure out how I’m going to get through having 
my mother with me for a week without telling her I’m 
pregnant. ’Cause I don’t want to tell her.” She elabo- 
rates very briefly saying she knows her mother will be 


upset and will think the patient shouldn’t have any 
more children, especially so soon after the miscarriage. 

The therapist returns to her news of being pregnant 
by asking the patient how she felt when she heard the 
A.Z. was positive. She replies that it struck her as 
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immediately after she 
She a Ids that 
other child. The theray ist 
her to the 


because she conceived 
and her husband decided to try 


amusing 
again 
she’il be glad to have ar asks 
this jibes with earlier statemer 
contrary. With many cor 


is glad because her husband wants another child, 


how t ts by 
tradictions, she says that she 
but 
that it’s not her favorite idea and she wouldn’t do it if 

didn’t feel The comments 
} 


how she is re g from 


patient 


several 


she she should 


briefly on signir community 
activities in order to keep her activities at a minimum 
and prevent another miscarriage 

The therapist asks then for further elaboration of her 
fear , The patient says again 
she is afraid to tell her mother and 


of mother’s reaction 
was always 
pregnancies The 


isn’t it.”” The pa 


that she 





afraid previous 





therapist remar iking, 
Oram knowledges tl 1d tl hil he did 
tient acknowledges this and states that while she did, 


her mother has always expressed 





and does, feel afraid, 
surprise when the patient tells her of this fear. Even so, 
the patient continues, she knows her mother really does 


disapprove and that she wi 


But then 

something 
The therapist 

know what the 
ike to know 

king about her rela 
ght start to find 


| be critical of her 
there n 
has said 


she expresses the belief that ust be 


mor to this feeling 
agrees doesn’t 
basis for this feeling could be and would 
The therapist states that by tl 


l 
SS 


tionship with her mother now she 1 
out 

Interpretation of the phase. This is regarded as a 
hase in which the patient starts to talk about the 


' 
g news that she is definitely 


int 


avoiding in part, and pre 


anxiety-evokir pregr 


> material she was 


paring for in part, in the first phase. Three things 
happen that increase her anxiety 1) she tells the 
therapist directly that she is pregnant, (6) under some 
pressure from the therapist, her surface “happiness” at 


being pregnant is rendered ineffectual as a defense 


r 
st the 


and (¢ th cetensive 


It elicited by her negative 


of blaming her 


agalr anxiety and gu 


lee 


) aspect 
mother for the anticipatory 





fear at telling her of being 


pregr yartially exposec 


ant is | 


s can be seen in Fig. 3, both the speech-disturbance 


and silence measures increase during this phase 


Phase 3 
Responding to thi 
the end of Phase 2 


theme that 


remark of the therapist made at 


the 





patient now introduces 


her mother has never seemed maternal or 


supportive, especially when something happens that in 


creases demands upon her. The patient elaborates this 
with rather detailed descriptions of two instances in 
which her mother had responded to illness in the 
family by seemingly becoming upset, excited, im 
patient, and anxious to escape the situations as soon as 


possible 
This is interpreted as a 


of a 


Inter pretation of the phase 
rather extended maneuver 
well-organized rationalization of her fear of the mother’s 
that 


defensive consisting 


reaction. Since this is a reinstitution of a defense 
had just previously been partially exposed, the anxiety 
provoked in Phase 2 should now decrease. (The fact 
that this reinstitution of the defense was necessary here 


contributed to the judgment of Phase 2 as conflictful or 


F 


Maul 


It will be seen in Fig. 3 that the behavioral 


both decrease during this phase 
? 


anxious 
linguistic measures 


from their level in Phase 


Phase 4 


The therapist asks how her mother had reacted to 
the patient’s previous pregnancies in such a way as to 
account for her present fear. The patient describes her 
mother as having been interested in the 
She repeats how her mother 


pregnancies, 
having brought gifts, etc 
had expressed surprise at the patient’s fear of her re- 
action. The therapist comments that this suggests there 
hasn’t been any basis in mother’s behavior 
justify her fear and that maybe the source of the fear 
The patient asks if the therapist 
herself and her mother or 
of the world. After the 
therapist replies that he doesn’t know the answer to this 
and that it must come from her, Mrs. “T’d like 
to know. I really to figure out what it is. I 
mean whether I’m 
feeling that I have, 
having them or what, I don’t know 
by saying the guilt might be because 


her to 
lies within herself 
means a feeling between 
between herself and the rest 


Says 
would like 
transferring to my mother some 
whether it’s a feeling of guilt about 


She el 


laborates this 
she doesn’t think 
she is a good mother to her children. The therapist asks 
if this is an attempt to intellectualize and says this 
could not have been present when she 
first time 
Mrs. Y agrees and wonders why she should 
Shortly she reports 
feeling blocked in thinking further about it. (It had been 
previously determined that reports of blocking by the 
patient are not themse ot 
Therefore such ambiguous references were left in the 


particular feeling 


was afraid to tell her mother the she became 
pregnant 


feel guilty about being pregnant 


] 
ves 


reliable clues silence 
script 

Interpretation of the phase. This is a conflictful or 
The patient should be highly conflictful 


her 


anxious phase 
the 


with the inadequacy of her 


because therapist is continually confronting 


previous accounts of why 
The 


tarily gives up the defensive rationalizing, is left blocked 


she fears her mother’s reaction patient momer 
in this mode of defensive behavior, and feels repressed 
near the end of the phase 
It in F 3 that the Silence Quotient rises 
markedly in this phase 
and the unedited typescript after the phase judgmen 
had that most of 
curred at the end of the phase. Thus, 
nt alluded to. The 


ig 


is seen 
Observation of the recording 


been made showed the silence oc- 
it instrumented 
the repression or suppression the patie 


Speech-Disturbance Ratio also rises in this phase 


Phase 5 

When the patient says she feels blocked at the end 
of Phase 4, the therapist states that this may often 
happen in her therapy and that while he could offer his 
thoughts about her fear of her mother, it really wouldn’t 
help. Mrs. Y replies that she realizes that he probably 
doesn’t want to plant ideas in her mind and that 
“T’ve gotta dig for it.”” She goes on to say that this fear 
of telling her mother is funny, because she doesn’t 
mind telling other people about being pregnant. She 
feels guilty about telling her mother and she shouldn’t 
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have to feel this way. But this is true of other things 
too, for she hasn’t told her mother very much about 
herself for years 

Interpretation of the phase. This was judged to be 


ynflictful phase. The therapist was 


another highiy « 
pressing the patient to continue without her defensive 
maneuvers, and the patient made open acknowledg 
ments that the source of the fear was within herself. In 
so doing, she was more directly giving up her pre 
potent defenses 

Both measures rose in this phase, although the rise 
over Phase 4 had not been specifically predicted. That 
they should do so makes sense ad hoc because the ther- 
apist was actively prodding the patient when she was 


already anxious and conflictful 


Phase 6 


rhe patient states that previous attempts to con- 
fide in her mother were “fl yps ” When the therapist 
asks what happened, Mrs. Y responds by recalling in 
some detail an incident in her teens whe she was in 
boarding school. She had told her mother that she and 
a boy were corresponding with each other. Shortly 
afterwards the boy unaccountably stopped writing and 
the patient was sure her mother had interfered, maybe 
because she did not approve of men. With collaboration 
from the the rapist the patient describes other instances 
of her mother interfering with her dating. In general she 
depicts her mother as having disapproved of all such 
adolescent heterosexual activities 

Interpretation of the phase. This was judged to be a 
defensive phase in which the patient’s anxiety should be 
decreased from the previous anxiety phase. The content 
was regarded as a displaced reference to the basic un 
conscious fear of giving expression to her feminine 
sexual desires and needs 

In Fig. 3, the Silence Quotient decreases in this 
phase, but the speech disturbance does not decrease 
This is commented upon in the discussion 


DISCUSSION 


References to silence, blocking, and con- 
fused speech by the patient are numerous ir 
clinical records and reports. But this is not the 
case in the scientific literature on objective re- 
search with recorded interviews. 

To the writer’s knowiedge there has been no 
systematic analysis of the disturbed speech 
phenomena considered in this paper in thera- 
peutic interviews. Some investigators have 
measured verbal activity or word output in 
interviews of various types and in relation to 
numerous variables (1, 2, 3, 4, 5, 6, 8, 11, 12, 
13, 14, 15, 26, 27, 30). Of these, only Ruesch 
and Prestwood (30) deal directly with anxiety 
in interviews. Discrete silences per se were 
made the explicit object of study by Tindall 
and Robinson (34). They conducted a descrip- 


tive study of silences in nondirective inter- 
views, covering such factors as who initiated 
the silences, their apparent meaning, and their 
effect on the progress of the interview. Gil- 
lespie (9) included the frequency of discrete 
silences in a resistance measure. Silence was 
measured implictly, of course, in most of the 
verbal activity studies. But the activity meas- 
ures have usually been such that they are not 
simply the obverse of the degree of silence. 

Chapple (2) devised a method for obtaining 
activity measures in interviews that required 
an observer to operate hand switches when the 
participants spoke. These switches activated 
magnetic markers that recorded on polygraph 
paper. A variety of temporal measures could 
then be made from such recordings, including 
instances and durations of simultaneous silence 
by both participants. Verzeano and Finesinger 
(35) later developed a speech analyzer that 
eliminated the need of the observer under 
certain conditions and that yielded auto- 
matically much of the data available with 
Chapple’s method. The method and apparatus 
used here to obtain the Silence Quotient are 
merely further refinements on Chapple’s idea 
for a special purpose. 

Thus, there is general awareness that silence 
and disturbances are significant aspects of the 
patient’s speech. Yet, as the summary review 
above shows, there has been relatively little 
investigation of these attributes in general and 
almost none of their specific use as indices of 
patient anxiety. The work described in this 
paper shows that the Speech-Disturbance 
Ratio and the Silence Quotient are reliable and 
discriminating measures, and suggests that 
there is cause for further studies of their 
validity as anxiety indices.? 

There were several reasons for using motiva- 
tional phase judgments as validation criteria. 
‘safer,”’ 


‘ 


Physiological measures usually seem 
but they involve their own methodological 
problems, often serious ones, and the psycho- 


2 There is also an unexpected side-product of the 
analysis of confusion in speech into the discrete dis- 
turbance categories. Individuals appear to differ in the 
degree to which they use the various categories and 
there are some lawful relations between the categories. 
Some of these findings were presented at the Eastern 
Psychological Association meeting, 1955, in a paper 
“The use of ‘ah’ in spontaneous speech” and are being 
prepared for publication now 
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physiology of emotions is not so well estab- 
lished as to provide one with ready-made meas- 
ures guaranteed to be generally applicable 
indices of various emotions. Clinical ratings of 
interviews are also candidates for validation 
criteria. Even when shown to be reliable, they 
threaten validation studies because they 
cover large units of time within which there 
may be considerable behavioral variation. The 
motivational phases are behaviorally as mean- 
ingful as physiological measures; they focus on 
smaller units of behavior than ratings of inter- 
views; and they prevent errors arising from 
being “‘content bound.” 

In the specific case of linguistic measures of 
anxiety, the defensive maneuver and con- 
flictful phases seem especially useful as valida- 
tion criteria. They are significant and frequent 
occurrences in psychotherapy. Sullivan, for 
example, presents a very useful discussion of 
them (33). This study might be regarded as an 
exploration, suggesting a procedure for estab- 
lishing adequate context for their judgment and 
for preventing contamination. A more adequate 
design, of course, would include the determina- 
tion of the reliability of the phase judgments. 
It was a calculated risk to proceed here with 
this gap in design, for negative findings would 
have been ambiguous concerning the anxiety- 
measuring property of the Speech-Disturbance 
Ratio and Silence Quotients. 

In the introduction it was stated that the 
basic working assumption of this research was 
that the most valid linguistic measures of 
anxiety will be those based on “expressive” 
aspects of speech rather than on manifest 
content analysis. The basis for this assumption 
must be briefly considered. First, however, 
what is meant here by “expressive”’ aspects of 
speech may be clarified. In a speech sample 
there are verbal (words) and vocal elements. 
The vocal elements consist of conventional vocal 
symbols and vocal clues or signs of emotional 
change that may be perceived by a therapist 
but were not intended as symbols by the 
patient. The former are illustrated by cultur- 
ally determined intonation patterns that serve 
a function analogous to punctuation marks in 
writing. The latter are illustrated by the speech 
disturbance categories described here. They 
might be compared to variations in writing 
pressure or letter formation that are unknown 
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to or unintended by the writer but could be 
detected by a measuring technique. The term 
“expressive” aspects of speech is used here to 
refer to only this second group of vocal 
attributes. 

Manifest content analysis of interviews 
yields quantitative descriptions of what the 
patient says. As generally practiced, however, 
small content units in written material are 
categorized in an uninterpretive way in the 
sense that the instrumental function of the 
content and the expressive properties accom- 
panying the content a/ the /ime it is uttered are 
ignored. (Probably the most extreme instances 
of such work is that in which individual state- 
ments of patients have been typed on separate 
cards and these wriften and isolated utterances 
used to measure affective states.) If one uses 
such manifest content scores obtained by un- 
interpretive means as indices of emotional 
states, he assumes that there is more often 
than not some sort of uniform (““between” and 
“within” patients) and usually direct reflection 
of an emotional state in the words uttered. A 
simple example would be measuring currently 
present fear in the patient by counting the 
number of sentences he utters containing fear 
words. At times this may be a valid procedure. 
But there are also many instances in which a 
person will not utter fear sentences when he is 
frightened and may utter them when he is not 
frightened. Illustrative occasions in therapy are 
those in which a patient, frightened by a 
therapist, may verbally attack him in content, 
though in an uncertain voice, and those in 
which a very dependent patient feels calm and 
protected and sounds comfortable, while en- 
grossing the therapist with accounts of fearful 
events. 

If the disparity between the manifest con- 
tent, on the one hand, and the motivational 
context and the expressive aspects, on the 
other hand, were relatively infrequent in psy- 
chotherapy, one could proceed safely to use 
content analysis in measuring emotions. But 
this type of disparity seems too frequent to dis- 
regard. It is certainly great enough in both 
everyday life and in psychotherapy to have 
received special consideration. Sapir (31) was 
referring in part to this distinction in his con- 
trast between language as an expressive as well 
as a symbolic-referential system. George H. 
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Mead (23) made this distinction in his lucid 
contrast of a “‘conversation of gestures” with a 
“conversation of significant symbols.” In the 
clinical literature, Reich’s (29) classical distinc- 
tion between “what” and “how” a patient 
speaks, and Sullivan’s (33) contrast between 
“vocal” and “verbal” behavior both call at- 
tention to the same problem. 

Probably all aspects of speech are subject to 
individual differences in learning and emo- 
tional usage. Thus, such expressive measures as 
Speech-Disturbance Ratios and Silence Quo- 
tients may be subject to the interpretive 
approach just as manifest content is. This 
seems most likely with silence, which at times 
may be a result of anxiety but at other times 
may be a form of stubbornness or hostility, of 
thoughtfulness, seductiveness, etc. To the 
extent that this is so, the routine use of speech 
disturbances and silence measures as indices of 
anxiety is subject to the same limitations as 
content measures. This research work is simply 
based on the starting belief that (a) the expres- 
sive measures are less influenced by the learning 
process in socialization and are thus less vari- 
able in their meaning and less subject to the 
errors of the uninterpretive approach than 
manifest content, and (0) inferences about the 
behavioral state of the patient based on uni- 
form treatment of them will be right signifi- 
cantly more often than wrong. Only further 
work will determine if this belief is correct. 

In the meantime it seems likely at least that 
the Speech-Disturbance Ratio and the Silence 
Quotient could be profitably used in conjunc- 
tion with simultaneous manifest content meas- 
ures like the Distress-Relief Quotient of Dol- 
lard and Mowrer (7), the “feeling” scores of the 
nondirective therapists (28, 32 e.g.), and the 
more refined content measures of Murray (25). 
These expressive measures might reduce the 
error in inferring behavioral states from the 
verbal content even though they do not turn 
out to be superior or adequate by themselves 
in this respect. This potentiality may be illus- 
trated with Phases 3 and 6 in Mrs. Y’s 13th 
session. In Phase 3 the patient accounts for her 
fear of telling her mother she is pregnant by 
referring directly to her mother’s failure to 
gratify people’s real needs for support and in- 
directly to her own fear of dependency depriva- 
tion. In Phase 6, the patient refers to her 


mother’s disapproval of her dating boys in her 
adolescence. This was regarded as a displaced 
reference to the patient’s fear of reproval for 
the feminine sexual manifestations involved in 
pregnancy. On the basis of content alone, it 
would be very difficult to state in which phase 
this patient was more successfully defending 
against anxiety-evoking thoughts. The expres- 
sive measures suggest that she was decidedly 
more anxious in Phase 6 than in Phase 3 in 
that the displacement in Phase 6 was a less 
successful defense than the rationalization of 
Phase 3. In the course of her therapy sexual 
fears and conflicts have been the source of 
much greater resistance than have dependency 
fears and conflicts. 

The effect on these measures of other vari- 
ables than motivational states must still be 
assessed. This seems particularly true with 
respect to the speech-disturbance measure and 
the effect of verbal intelligence, education, 
social class, and sex. One gets the impression 
that these variables do not seriously limit the 
value of the Speech-Disturbance Ratios in 
making comparisons “within” patients, but 
systematic evidence is lacking. Also it is likely 
that a way to combine the two measures into a 
single one would be useful. Since silence seems 
best regarded as a defense, the Silence Quotient 
may increase due to mounting anxiety and yet 
reduce the Speech-Disturbance Ratio. Figure 3 
suggests that this type of interaction may have 
occurred in the conflictful phases of Mrs. Y’s 
13th session. This raises a difficult problem of 
weighting the two measures. Scores obtained 
by simple z-score combinations have not 
seemed as useful as the individual measures 
taken separately in some preliminary attempts 
along these lines. 


SUMMARY 

Linguistic measures of the patient’s anxiety 
in therapeutic interviews can be useful in 
several areas of research. A working assumption 
is that the most valid measures will be based on 
the expressive aspects of speech rather than on 
manifest content measures. On empirica] and 
theoretical grounds, speech disturbance and 
silence seem to be expressive attributes that 
are useful as anxiety indices. Reliable methods 
for measuring these two attributes have been 
described. Data from two cases show that the 
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two measures discriminate “something” be- 
tween-sessions and within-sessions for a given 
patient. Motivational phase judgments have 
been proposed as useful criteria for validating 
the Speech-Disturbance Ratio and Silence 
Quotient as anxiety measures. Detailed knowl- 
edge of the dynamic context of the interview is 
necessary to make such judgments of criteria 
since manifest content is of limited value in 
this regard. A procedure for acquiring the con- 
text without contaminating these judgments 
has been described. Preliminary results of the 
proposed validation procedure indicate that 
the present approach is fruitful. The research 
and ideas presented here are exploratory and 
can only be regarded as providing leads that 
might be of interest to others working in this 
area so costly of research time. 
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ONE DETERMINANT OF THE CAPACITY TO FREE-ASSOCIATE 


~ 


IN PSYCHOTHERAPY’! 
MAURICE K. TEMERLIN 


University of Oklahoma 


HEN at the outset of analytically- 

oriented psychotherapy a client is 

asked to verbalize everything that 
comes into his mind, without organizing his 
thoughts and no matter how trivial, embarrass- 
ing, illogical, painful, or confused his thoughts 
may seem, he then is forced to respond to a 
situation which for him is almost totally with- 
out structure. From his point of view, the in- 
structions to free-associate create an unstruc- 
tured situation because he is asked to say 
everything that comes into his awareness but he 
is not told what to say specifically or how to 
say it. Moreover, he is not told in what se- 
quence he “should” bring forth the content of 
his awareness, or what feeling, gesture, tone of 


voice, or emotion “‘should” be correlated with 


his utterances. Finally, he frequently does not 
know what his responses mean or how the 
therapist is interpreting them, and often he 


can perceive no relationship between attempt- 
ing to free-associate and receiving the help for 
which he originally asked. 

Within this framework it is axiomatic that 
individuals differ greatly in their capacity to 
approximate free association at the outset of 
treatment, with few persons actually being 
able to associate freely. Too, the instructions to 
free-associate usually produce a wide variety 
of defensive reactions on the part of the client. 
If there are idiosyncratic, internally consistent 
aspects of personality which are revealed under 
conditions of reduced external structure (as in 
the case of projective tests, for example), then 
individual differences in the capacity to free- 
associate at the outset of treatment should be 
correlated with individual differences in be- 
havior in other unstructured situations. 

Since the autokinetic situation as it has been 
used in recent studies (1, 2, 4) tends to be as 
free of external standards and criteria as the 

1 This study was one part of a doctoral dissertation 
in the Department of Psychology at the University of 
Oklahoma. Grateful acknowledgement is made for help 
given by Dr. William B. Lemmon and for advice and 
apparatus given by Dr. Muzafer Sherif. Mr. Bernard 
Moskowitz provided statistical consultation. Dr. Alfred 
F. Glixman and Mr. Robert S. Wilson read the manu- 
script and made many valuable criticisms and sugges- 
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therapeutic situation described above, the vari- 
ability of behavior in the early stages of psy- 
chotherapy should be correlated with the var- 
iability of successive judgments of movement in 
the autokinetic situation. Specifically, it was 
hypothesized that clients whose behavior when 
asked to free-associate is highly variable, flexi- 
ble, or productive exhibit significantly greater 
variability when judging the extent of auto- 
kinetic movement on successive trials than Ss 
whose behavior at the outset of psychotherapy 
is highly inflexible, stereotyped, or rigid. 


METHOD 

Subjects 

Seventeen Ss were selected from a popula- 
tion of 180 neurotic college students who had 
requested intensive, outpatient, analytically- 
oriented, personal psychotherapy from the 
Guidance Service of the University of Okla- 
homa between 1951 and 1953. Three categories 
descriptive of behavioral variability, flexibil- 
ity, or productivity during the first 20 hours 
of psychotherapy were defined as follows: 


Category 1. Extremely Flexible and Productive Be 
havior. When asked to free-associate, the client through- 
out the first 20 hours consistently behaves in ways that 
the therapist considers extremely variable, flexible, or 
productive. Examples: The client assumes personal re 
sponsibility for the conduct of his therapy and works 
hard to free-associate; he produces material which is 
meaningful; he interacts emotionally with the therapist 
and “lives out” emotional experiences during the hour; 
he demonstrates a willingness to come to terms with his 
resistances; he gives interpretations a fair hearing, 
neither rejecting them blindly and dogmatically nor ac- 
cepting them passively and compliantly 

Category 2. Extremely Rigid and Unproductive Be- 
havior. When asked to free-associate, the client through- 
out the first 20 hours consistently behaves in ways that 
the therapist considers extremely inflexible, stereotyped, 
or rigid. Examples: The client consistently “blocks” and 
is unable to express himself; he seems almost totally un 
willing or unable to express the content of his awareness; 
he passively and objectively describes his experience 
in an affectless, uninvolved, or abstract manner; he 
assumes no responsibility for the conduct of the thera- 
peutic hour; he demonstrates an inability or an unwill- 
ingness to come to terms with his resistances; he does 
not interact with interpretations 

Category 3. Miscellaneous Behavior 
does not clearly and consistently fall within either of the 


3ehavior which 


above categories 
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The author, who served as Therapeutic Prac- 
ticum Supervisor at the Guidance Service, 
classified each of the 180 members of the popu- 
lation within one of these categories. His judg- 
ments were based upon previously obtained 
records of staff conferences, supervisory or con- 
sultant contacts, or practicum course presen- 
tations. The therapist of each S was then asked 
to classify his client (or clients) within one of 
the three categories on the basis of his personal 
observations of the client’s first 20 hours of 
therapy. Both therapist and experimenter had 
to agree on each person’s classification; and if 
either therapist or experimenter had any doubt 
or disagreement whatsoever regarding the 
client’s classification, he was placed in Cate- 
gory 3 and not used as a subject. 

This procedure yielded 7 Ss in Group 1 and 
43 Ss in Group 2. Of the total population of 
180, only the 7 persons classified in Group 1 
were judged able to approximate relatively 
free association during the first 20 hours of psy- 
chotherapy. The 43 members of Group 2 were 
then matched as closely as possible with the 
7 Ss of Group 1 in terms of intelligence, sex, 
socioeconomic status, and the number of hours 
of psychotherapy experienced at the time the 
autokinetic data were collected. The two 
groups then had these characteristics: 


Group 1. There were 5 males and 2 females in this 
group. Ages ranged from 18-30 with a median of 22.0 
years. Their IQ’s ranged from 117 to 132 with a median 
of 124. The number of hours of psychotherapy each S 
had experienced at the time the autokinetic data were 
collected ranged from 21 to 46 with a median of 29 

Group 2. There were 7 males and 3 females in this 
group. Ages ranged from 18.5-32 with a median of 23.6 
years. Their IQ’s ranged from 118 to 130 with a median 
of 125.6. The number of hours of psychotherapy ranged 
from 22 to 67 with a median of 33.2 All Ss in both groups 
were of upper-middle socioeconomic status. 


A pparatus 


The apparatus was the same as that used by 
Sherif (4). In a dark and soundproof room a 
pin point of light was exposed through a circu- 
lar hole one millimeter in diameter. The ap- 
pearance of the light, the exposure time and 
the interval between successive exposures were 

? This study was conceived and executed within an 
analytically-oriented framework where the therapeutic 
process is thought to take from 300 to 500 hours. In 
these terms, the data of this study were collected at the 
outset of treatment, and it was felt that none of the Ss 
had shown any significant personality changes as a 
function of psychotherapy 


controlled automatically by a series of relay 
circuits and were constant for all Ss on each 
trial. 


Procedure 
The S was told: 


We are trying to perfect a test which will predict how 
well a person is going to do in psychotherapy. However, 
your only task will be to tell me how far a point of light 
moves. I am going to take you into a dark room and 
show you a point of light. Your job will be to judge the 
distance the light moves, making your judgments as 
accurate as possible. I want your eyes to get accustomed 
to the dark, so I want to blindfold you here and lead 
you into the dark room. 


The statement that the experiment was an at- 
tempt to predict therapeutic success was an 
effort to produce the kind of ego-involved un- 
certainty frequently experienced by persons 
beginning psychotherapy. 

The S was then blindfolded and led by the 
hand over a long (75 yards) and complex route 
and seated in the experimental room. This pro- 
cedure was designed to produce greater spatial 
disorganization and thus maximize individual 
differences. It has been demonstrated that in- 
dividual differences in the variability of judg- 
ments of autokinesis increase with experimen- 
tally-induced uncertainty (4). 

In the darkened room the blindfold was 
removed, and these instructions were given: 


Periodically, I will show you a peint of light. After a few 
seconds the light will start to move. As soon as you see 
the light move, press this button A few seconds later 
the light will go out. As soon as the light goes out, tell 
me in inches the total distance the light moved, making 
your judgments as accurate as possible. 


Fifty judgments were taken from each S in 
this manner: Three seconds after the S had 
signified that he had perceived movement, the 
light was turned off automatically. The S then 
voiced his judgment and the light automati- 
cally reappeared 60 seconds later. 


Treatment of the Data 


The average deviation from the median 
judgment was determined for each distribu- 
tion of 50 judgments and this score was used 
as the measure of each S’s judgmental varia- 
bility. The significance of the difference be- 
tween the autokinetic variability scores of the 
two groups was determined with the U statistic 
of Mann and Whitney (3). 
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RESULTS AND DISCUSSION 


The variability scores for the 7 Ss of Group 1 
ranged from 2.18 to 6.08 with a median of 4.74. 
The variability scores for the 10 Ss of Group 2 
ranged from .37 to 2.44 with a median of .89. 
Eight of the 10 Ss in Group 2 achieved a lower 
variability score than that obtained by any S 
in Group 1. 

The Mann-Whitney U test was calculated 
for m = 10 and m = 7 as suggested by Moses 
(3). A U value of 1.15 was obtained which is 
4.92 standard deviations from the mean of the 
U distribution. This value is significant well 
beyond the .01 level of confidence. 

The highly significant difference between 
the autokinetic variability scores of the two 
groups lends strong support to the hypothesis 
of a relationship between the variability of be- 
havior in these two unstructured situations. 
This appears consistent with the theory of 
projective tests that there are internally con- 
sistent, idiosyncratic aspects of personality 
which are revealed under conditions of reduced 
or ambiguous external structure. 

These results are considered empirical evi- 
dence that those (few) people who possess the 
ability to render relatively free associations at 
the outset of analytically-oriented psycho- 
therapy appear to have a greater capac ity to 
experience visual apparent movement in the 
unstructured autokinetic situation than people 
whose stream of consciousness flows less freely 
under the same conditions. It seems probable, 
in the light of the projective hypothesis, that 
these results may be interpreted as a function 
of the tolerance an individual has for experi- 
encing himself. When a neurotic who has re- 
quested psychotherapy is asked to free-asso- 
ciate, he is forced by the unstructuredness of 
the situation to respond essentially to an actual 
or anticipated experiencing of himself. If he 
follows the instructions to free-associate, his 
own attitudes, motives, memories, perceptions, 
of which the self is an 
organization later constitute 
the content of his awareness and be verbalized. 
The interpretation in terms of tolerance for 
experiencing the self is thus consistent with the 


thoughts, and feelings 


will sooner or 


common clinical observation that the freer an 
individual is simply to relax and to “be him- 
self,” the more variable and spontaneous is his 
behavior. On the other hand, individuals with 
less capacity to “be themselves” appear to 
exert greater control over their inner experi- 
ence with a resultant decrease in the variability 
or spontaneity of their behavior. 

Two qualifications require explicit state- 
ment. The rigorous method of selection limited 
the sample to a small number of highly select 
Ss whose representativeness may not be suffi- 
ciently broad to permit wide generalization. 
Second, it is important that these results rot 
be interpreted in terms of therapeutic out- 
comes, for the problem of prognosis was con- 
sidered peripheral to this study. Furthermore, 
Ss within both groups eventually deviated 
from the pattern they had demonstrated con- 
sistently during the first 20 hours of psycho- 
therapy. 


OUMMARY 


Those Ss who were unusually variable, flexi- 
ble, and productive when first asked to free- 
associate in psychotherapy were compared 
with a matched group of extremely rigid, in- 
flexible, and stereotyped Ss with respect to the 
variability of their perceptions of autokinesis. 
Significant differences were found. It was con- 
cluded that the greater an individual’s toler- 
ance for experiencing himself, the more 
variable and spontaneous is his behavior in 
unstructured situations. 


REFERENCES 


1. Ketman, H. C. Effects of success and failure on sug 
gestibility in the autokinetic situation. J 
soc. Psychol., 1950, 45, 267-285 

2. Kiem, G.S., & Scuiesincer, H. J. Perceptual atti 

I. Prediction of apparent 


abnorm 


yeh rf , 


tudes toward instability 
movement experiences from Rorschach responses 
J. Pers., 1951, 19, 289-302 
3. Moses, L. E. Non-parametric statistics for psycho- 
logical research. Psychol. Bull., 1952, 49, 122-143 
4. SHerir, M., & Harvey, O. J. A 
Elimination of stable anchorages in 
1952, 


study in ego 
functioning 
individual and group situations. Sociomeiry, 


15, Nos. 34 


Received July 5, 1955 





A TEST FOR SPECIFIC AND GENERAL 


BEHAVIORAL EFFECTS OF 


INFANTILE STIMULATION WITH SHOCK IN THE MOUSE! 


WALTER C. STANLEY 


Brown University 


ROBABLY the fundamental assumption of 
personality theory in general and psycho- 
analytic theory specifically is that the 
helplessness of infancy and childhood breeds 
personality disorder. For example, White states: 


It takes high-powered anxiety to make a neurosis. 
Acute combat stress or similar traumatic situations can 
do it in adults. Otherwise only the panics and appre- 
hensions that occur in the helpless years of childhood 
are sufficiently high-powered (8, p. 258). 


A statement such as the foregoing implies a 
causal relationship between helplessness and 
severe anxiety. And, in such a context, we may 
interpret helplessness in at least two ways: (a) 
as objective helplessness, e.g., the organism is 
repeatedly exposed to “painful” or “frighten- 
ing” stimulus situations, but the termination 
of such aversive stimuli (i.e., reinforcement) is 
causally* unrelated to any specific response; or 
(b) asa “sense”’ or “feeling”’ of helplessness, e.g., 
the organism’s recognition of its objective 
helplessness. 

It should be quite obvious that this second 
meaning of helplessness and its presumed rela- 
tionship to severe anxiety cannot be directly 
tested in infrahuman animal subjects, simply 
because such subjects cannot tell us whether 


they recognize their helplessness. However, the 


first meaning of helplessness and its assumed 
relationship to anxiety can be tested with such 
animal subjects. Accordingly, the present exper- 
iment was designed to test the hypothesis that 
exposure of the infant mouse to arbitrary shock 
(i.e., shock which the organism could not termi- 
nate by some specific response) would lead to 
greater fearfulness later on than exposure to 
shock whose offset was made contingent on a 
specific response.* The measure of emotionality 
or fearfulness utilized was the open-field test 

1 About two-thirds of this experiment was carried 
out at Brown University by the second author as an 
Honors Thesis under the direction of the first author 
who prepared this report. 

2 To simplify presentation of the problem we omit 
consideration of the obviously interesting case where 
fortuitous reinforcement may generate “superstitious” 
behavior. 

*Cf., the method of Mowrer and Viek (4). 
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(1, 2). As adults, the mice were also given avoid- 
ance training trials to assess the theoretically 
more transferable or generalizable effects of the 
infantile treatments. 


METHOD 

Subjects 

The Ss were offspring of genetically homogeneous 
mice (dba strain, subline 1) obtained from the Roscoe 
B. Jackson Laboratory in Bar Harbor, Maine. Eighty- 
four Ss who received differential infantile treatment 
were available for later tests. Of these, 36 were tested 
only in the open field, and 48 were tested both in the 
open field and in avoidance learning. In addition, data 
on performance during infantile stimulation with 
shock were available on nine Ss who did not survive 
for the later tests. 


A pparatus 


Infantile shock apparatus. The apparatus used for 
infantile stimulation consisted of a wooden box, 934 
in. long, 4 in. wide, and 244 in deep. The shock grid 
was 5 in. long and was made of g¢-in. bars spaced 
14 in. apart. To minimize S’s being trapped in either 
corner of the shock compartment, a U-shaped piece 
of Lucite formed the actual walls of this compartment, 
thus eliminating its two corners. A sliding door, lined 
with Lucite, was used to bar entrance to the safe side. 
Shock to the grid was 105 v., with a fixed resistance of 
500,000 ohms in series with the grid. 

Open field test. This apparatus was like that used 
by Lindzey (5). It consisted of a circular table, 39 in. 
in diameter, which was covered with white oil cloth. 
It was illuminated by a 150-watt bulb suspended 36 
in. above its center. 

Adult avoidance-training apparatus. This apparatus 
consisted of a wooden box whose inside dimensions 
were 30 x 434 x 6 in. deep. The entire apparatus 
was covered with hinged Lucite lids. The grid com- 
partment was 20 in. long; the safe compartment 10 
in. long. A guillotine door separated the two com- 
partments. The floor sections of both compartments 
were hinged to provide, along with appropriate holding 
relay circuits, for onset of shock 1.5 sec. after S was 
placed on the grid and for the activation of a Spring- 
field timer which yielded a measure of S’s running 
time to the safe compartment. ' 


Procedure 


The over-all procedure is summarized in Table 1. 
Further details of the design and procedure follow: 

Housing. Throughout their lives, Ss lived in con- 
ventional two-compartment wooden mouse boxes. A 
constant artificial light-dark cycle of 14/10 hr. was 
maintained through automatic control of an overhead 
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fluorescent fixture. Room temperature approximated 
67°, varying from 62° to 72°, and relative humidity 
varied from 21 to 40%. 

Infantile stimulation. The Ss within litters were 
randomly assigned to the differential infantile treat- 
ments with sex balanced insofar as possible. The three 
types of treatment were: (a) exposure to shock whose 
offset was contingent on S’s moving to the safe end 
of the apparatus (response-contingent group); (d) ex- 
posure to shock equal in duration on each specific 
trial to the amount of shock received by the matched 
litter mate in the response-contingent group (arbitrary 
shock group); and (c) exposure to the shock apparatus 
with matched trial durations but with shock turned 
off (the no-shock group). 

The Ss received five trials per day for four days. 
Intertrial duration was 1 min. Thus, each response- 
contingent S completed its daily session before its 
mates in the arbitrary and the no-shock groups were 
run. However, a!) Ss in a litter were removed from 
and returned to the nest at the same time to equalize 
durations of absences from the nest. Timing was done 
with a stop watch. 

Open field test. All Ss tested received a 2-min. expo- 
sure once a day for 10 consecutive days. Approximately 
two-thirds of the Ss were lifted by hand from their 
living cages and placed on the table top; the remaining 
one-third were released through specially built doors 
in the living cages and thus were not handled by E 
except after each 2-min. test period. Testing order of 
Ss within litters was balanced from day to day. Defe- 
cations were recorded by E. 

Adult avoidance training. The Ss were run in trios 
of litter mates consisting of a response-contingent, an 
arbitrary shock, and a no-shock mouse. The interval 
between placement of S in the start box and the 
onset of shock was 1.5 sec. The duration of intertrial 
intervals was about 6 min., with all Ss in a trio com- 
pleting Trial 1 before any received Trial 2, etc. The 
S in a trio run first was balanced among the three 
groups. Each S was run to a criterion of five successive 
avoidances in a single training session. The £E also 
recorded response time, the time required for S to 
reach the safe compartment, to the nearest .1 sec. 

It is important to note that the shock apparatus 
and procedure used with adult mice were different 
from those used with infant mice in the response- 
contingent group. There were two reasons for this, 
one practical and one theoretical. The practical reason 
was that the infant apparatus and training were 
adapted to a small, nonseeing (6), nonjumping, crawl- 
ing, and essentially hairless animal, while the adult 
apparatus and training were adapted to a larger, 
seeing, jumping, swiftly darting, and hair-covered 
animal. The theoretical reason was that our major 
hypothesis required escape conditioning operations in 
infancy, but our check on the effects of infantile escape 
conditioning, we assumed, would be most sensitive if 
avoidance conditioning operations were used. Solomon 
and Wynne (7) have shown in dogs that a phase of 
escape learning precedes avoidance performance. Ac- 
cordingly, it was felt that any effect of infantile escape 
conditioning operations would be most likely to be 
demonstrable by giving adult Ss avoidance training 
directly in a simple alleyway apparatus. 


RESULTS 


The infantile training data for the response- 
contingent group were analyzed to determine 
whether any improvement in performance had 
occurred. None had. For example, the median 
daily response times, based on each S’s median 
response time for its five trials, were 8.5, 8.8, 
10.0, and 9.8 sec. for an N of 37. Perhaps the 
clearest indication of chance changes in per- 
formance came from a comparison of each S’s 
response time on its very first trial with its 
vesponse time on its last trial. If we include the 
two Ss who received only 15 infantile training 
trials, the V for this comparison is 39. Nineteen 
Ss performed better and 20 Ss performed more 
poorly. 

The open-field data are summarized in Table 
2. It is evident from the table and statistical 
evaluations that the three types of infantile ex- 
perience had no differential effect on excretory 
behavior in the open field, other than what may 
be attributed to chance. 





TABLE 1 
OUTLINE OF PROCEDURE 

Age in Days Treatment 
a Toes clipped for identification 
8-11f Infantile exposure to shock ap- 

paratus 
30 Weaning 
45-54} Open-field test of emotionality 


59-133, M =90 One day of avoidance training to 
criterion 





* Day 8 for 6 Ss. 
t Days 9-11 for 6 Ss. 
t Days 42-51 for 6 Ss; Days 35-44 for 3 Ss 








TABLE 2 
Fretp Test 
| Group 
, |Response-| 
Total NV | arog — N °u-ditt§ 
| Shock | nas? 
| shock | Offset | Shock 
| Offset | | 
eo —— — EE 
14 trios* 3.1 | 3.5 | 3.3 | .82, 1.24, .95 
24 triost 4.5 3.7 | 3.6 | .84, .91, .84 
30 pairst 4.1 3.8 | — | -74 


* Later tested for avoidance learning (see Table 3). 

t Inclusive of the 14 trios 

t Inclusive of 24 pairs from the 24 trios. 

§ Respectively, for the response-contingent vs. arbitrary shock, 
the response-contingent vs. no-shock, and the arbitrary shock vs. 
no-shock comparisons involving litter-matched Ss. 
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TABLE 3 
PERFORMANCE IN ADULT AVOIDANCE TRAINING 
Group 
Individual Measure Total V Response-Contingent Arbitrary Shock Offset No-Shock* 
Shock Offset* . 
Mdn. QyandQ, Mdn. QyandQ,; Mdn. Qyand Q, 
Trials to criterion ~ 16 trios 10.0 ee: BS 10.5 7.0, 11.5 10.5 9.5, 12.5 
Median response time 14 triost 1.6 0.9,1.9 1.8 1.2, 2.0 1.8 1.0, 2.2 


(sec.) in 1st 9 trials 





t Original data for two Ss in the ro-shock group were lost. 


The adult avoidance-training data appear in 
Table 3. The table presents two measures: (a) 
trials to achieve an avoidance criterion of five 
successive avoidances, including the criterion 
trials; and (6) median response time for the first 
nine trials since nine trials were the least any 
S received. According to both measures of per- 
formance, the response-contingent group was 
superior to both other groups. However, only 
the difference in running times between the two 
theoretically extreme groups, the response-con- 
tingent and the no-shock, achieved statistical 
significance according to Wilcoxon’s (9) non- 
parametric test for grouped data. The grouped 
data rather than the paired replicates test was 
used because the adult avoidance training was 
carried out at various ages. The source of varia- 
tion due to this variable plus that correlated 
with litters was eliminated in the statistical 
test by treating the data as replications of com- 
parisons involving four homogeneous Ss. The 
quartiles which appear in Table 3 are included 
to indicate the rather small amount of vari- 
ability present in each group, and thus the 
simplicity of the adult avoidance learning task. 


DISCUSSION 


The open-field data clearly provide no support 
for the hypothesis that infantile training with 
escapable shock leads to less fearfulness than 
infantile exposure to nonescapable shock. It 
might be argued that this negative finding fol- 
lows from the fact that our infantile training 
procedure failed to demonstrate any learning 
to escape shock as measured by the performance 
of the infantile mice. A better test of the hypoth- 
esis would have been provided if our response- 
contingent infantile group had actually im- 


* Difference between these two groups has a one-tailed » value of .02 for the response-time measure 


proved in performance during their infantile 
training. 

On the other hand, it should be recognized 
that our response-contingent shock offset pro- 
cedure did assure that each time actual offset of 
shock occurred, the infantile S’s last response to 
shock on each trial was locomotory movement. 
In this sense, the response-contingent group 
may be contrasted with the arbitrary shock 
group where such a consistent relationship be- 
tween shock, locomotion, and shock offset wa 
not assured, and even more clearly contrasted 
with the no-schock group. Thus, our finding 
that in adult avoidance training the more sensi- 
tive measure, running time, showed the re- 
sponse-contingent group reliably superior to 
the no-shock group is most suggestive of further 
research. 

It could indicate: (a) that the specific S-R 
relationships between shock and locomotory 
movements were strengthened during the in- 
fantile training; (6) that the infantile training 
provided both the response-contingent and the 
arbitrary shock groups with a learned fear drive 
(3) which was not evinced in faster running 
during adulthood by the arbitrary shock group 
because for this group during infancy shock 
terminated on some trials while Ss were not 
moving, thus theoretically strengthening re- 
sponses incompatible with locomotory move- 
ments; or (c) that the response-contingent group 
was generally most fearful of the three groups. 
To argue for either a or 5 would require the 
further assumption that the infantile training 
procedure was too insensitive to provide a good 
measure of learning since the response times of 
the two shock groups did not differ during the 
infantile training. To argue for c would require 
the assumption that excretory responding was 
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too insensitive as a measure of fearfulness. In 
this connection, it is worth mentioning that E’s 
impression was that the response-contingent 
Ss were the easiest to pick out of the “safe” 
compartment of the adult training apparatus. 

However, if we take the statistically signifi- 
cant finding at face value and also take into 
account the fact that our open-field data pro- 
vide no evidence for differential general fearful- 
ness among our three groups, then our positive 
finding indicates a remarkable specificity in 
the effects of infantile exposure to shock on 
later behavior. The response-contingent group 
was favored by both stimulus specificity (i.e., 
shock was used in infantile and adult training) 
and response specificity (i.e., the infantile 
training correlated shock termination with 
locomotory movements and the adult training 
also required locomotory movements). The 
data clearly suggest that it may prove heuristic 
to conceptualize the relationship between in- 
fantile experience and adult performance in 
terms of such specific transfer of training. 

The present findings are also relevant to the 
“critical period” hypothesis proposed by Scott 
and Marston (5). The hypothesis divides 
mammalian behavior into four periods, which 
are regarded as providing new and marked 
opportunities for relatively permanent behav- 
ioral integrations based on conditioning, so- 
cialization, and maturation of the nervous 
system. Scott and Marston suggest that con- 
ditioning should be difficult, or unstable if 
achieved at all, during the first or neonatal 
period. This period lasts until the mouse is 
about 12 days old. Thus, the superiority of the 
response-contingent group as compared with 
the no-shock group in the adult avoidance test 
is not in accord with the postulated instability 
of conditioning during the neonatal period. 


SUMMARY 


Mice, 8 to 11 days of age, were subjected to 
three types of infantile treatment: (a) stimula- 
tion with shock which S could terminate by 
crawling to the safe side of the apparatus (the 
response-contingent group); (6) stimulation 
with shock which was merely matched in dura- 
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tion to that received by Ss in the response- 
contingent group, with escape from the shock 
compartment barred (the arbitrary shock 
group); and (c) exposure to the shock appara- 
tus for matched durations of time without 
shock being present (the no-shock group). An 
open-field test of emotionality conducted when 
Ss were 45 to 54 days old showed no significant 
differences in the excretory behavior of the 
three groups. An adult avoidance learning test 
conducted when Ss had a mean age of 90 days 
showed that the running times of the response- 
contingent group were slightly but statistically 
significantly better than those of the no-shock 
group. The present experiment provides no 
evidence for any general emotional or trauma- 
tizing effect of either type of infantile shock 
stimulation, but does indicate the presence of 
a specific behavioral effect which is most 
readily understandable in terms of the corre- 
lation between shock, locomotory movement, 
and shock termination in the infantile training 
of the response-contingent group. 
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PERFORMANCE OF FUNCTIONAL PSYCHOTICS ON A 
REPETITIVE TASK! 
P. H. VENABLES anp J. TIZARD 


Medical Research Council, Unit for Research in Occupational Adaptation, Institute of Psychiatry, 
London, England 


N experiment by Kraepelin (3) in- 
volving a task of simple addition for 
periods of ten minutes demonstrated 

that the performance of schizophrenics is 
characterized by a rapid deterioration. When 
a pause is introduced after five minutes, per- 
formance increases markedly on the sixth 
minute, but then sinks to a low level during 
the remaining four minutes. Hoch (1) re- 
ported that manic-depressive patients show 
reversals of fatigue effects on the ergograph. 
Performance was found to increase in strength 
before a reduction appeared with exhaustion. 
In view of the somewhat unsatisfactory re- 
porting of these experiments and the presence, 
particularly in the addition task, of interfering 
factors such as pauses at the end of a line of 
figures, it was decided to make an attempt to 
repeat, on an objectively scorable task, the 
work of these earlier writers. 

Pilot experiments suggested that addition 
and color naming involving verbal reports were 
unsatisfactory because of practice effects on 
the addition test and pauses due to breathing 
in the color-naming test. The investigation to 
be reported was made using a five-choice task 
where scoring was entirely objective; no inter- 
ruptions were made to disturb S’s output, and 
the task, involving less cerebration than does 
addition, was within the ability of Ss showing 
a marked degree of psychosis. 


METHOD 


A pparatus 


The apparatus was designed to provide a means 
whereby response to a light stimulus is instrumental 
in setting up a subsequent stimulus, this process being 
capable of indefinite repetition. The S’s panel consists 
of five lights and five keys arranged in pairs sym- 
metrically around the circumference of two concentric 
semicircles. The inner circle on which the keys are 
placed is seven inches in diameter, the outer with the 
lights is ten inches in diameter. At the center of the 
semicircles is a sixth key. In operation, S presses the 








1 This work was carried out in Springfield Hospital, 
London, by kind permission of the Medical Super- 
intendent, Dr. H. C. Beccle. Advice on the statistical 
analysis was given by Mr. A. S. C. Ehrenberg. 
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key corresponding to a light which is lit and then the 
center key, whereupon the light changes. The se- 
quence is continued by pressing the key next to the 
light which is then illuminated, and so on. 

The control of the display is achieved by a uni- 
selector and relay mechanism modified from that 
designed by Leonard (4). 

An event recorder recorded separately both correct 
responses and errors. A time marker permitted taking 
a count of responses in any period. 


Subjects 


Thirty male schizophrenics and 10 men suffering 
from endogenous depression were used as Ss. They 
were selected after a scrutiny of hospital records and 
an inquiry to charge nurses as to whether they thought 
the patients would be willing to cooperate. Six other 
patients selected in this way did not complete the 
tasks. No S was over 55 years of age. Diagnoses had 
been made by the medical superintendent for official 
records. 

The schizophrenics were divided into three groups 
by age (20-29, 30-39, 40-55 years), and two groups 
by length of stay. Short-stay patients had been in 
the hospital less than one year from their most recent 
admission; long-stay patients had been in the hospital 
for more than two years since their last admission. 
There were thus six groups of schizophrenics, with five 
Ss in each group. All were diagnosed as suffering from 
other than paranoid types of schizophrenia (simple, 
catatonic, or unclassified) without organic or other 
complicating features. Paranoid schizophrenics were 
excluded because a pilot study had shown that their 
response patterns differ from those of other schizo- 
phrenics. All except one of the short-stay patients 
had had more than one admission to hospital. No S 
was receiving physical treatment or sedatives at the 
time of the investigation 

The 10 endogenous depressives had a mean age 
of 49.4 with an SD of 3.6 years. In age they were com- 
parable with the 40-55-year-old schizophrenic group, 
who had a mean age of 47.3 and an SD of 2.9 years. 
Seven Ss had had more than one admission to hospital 
with a depressive diagnosis; three involutional depres- 
sives were in hospital for the first time. 


Procedure 


The Ss were told that the test was “to see how 
quick they were” and that any questions they asked 
would be answered at the end of the experiment. 
Each S was given a 10-minute run on the task followed 
by a rest pause of one minute and a further 10-minute 
run. During the rest pause, S was engaged in con- 
versation. { 

A continuous record of responses was made for each 
S. To simplify the analysis, the scores used for further 
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calculation were the average score per minute of suc- 
cessive two-minute periods. Linear regression con- 
stants, calculated for each S, summarized his scores 
and indicated his initial level and trend of performance 
in.each 10-minute work period. 


RESULTS 


The mean scores per minute for each of the 
schizophrenic groups and for the depressives 
are shown graphically in Fig. 1. Application of 
analysis of variance to the regression con- 


stants obtained from the schizophrenic groups 
gave the following results. 

There were no significant differences among 
the schizophrenic groups in slope (trend of 
performance). All showed a tendency to rise 
during the first 10 minutes and to fall during 
the second ten minutes. 

The differences in level (speed of perform- 
ance) between the three age groups were not 
significant. There was, however, a difference 
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TABLE 1 
ANALYSIS OF VARIANCE OF INITIAL LEVEL OF 
PERFORMANCE OF SCHIZOPHRENIC GROUPS 
IN PEeriop 1 


Source 





Age 

Length of Stay 

Interaction 

Within Cells 
Total 


* Significant at the .01 level. 


TABLE 2 
ANALYSIS OF VARIANCE OF INITIAL LEVEL OF 
PERFORMANCE OF SCHIZOPHRENIC GROUPS 
tN PERIOD 2 


Mean F 
Square 


Source df 





30.1 
444.67 
12.54 


Age 

Length of Stay 

Interaction 

Within Cells 
Total 


* Significant at the .05 level. 


0.36 
§.3i1° 
0.15 


in level (significant at the .01 level in Period 1 
and at the .05 level in Period 2), between 
short-stay and long-stay schizophrenic Ss, the 
short-stay Ss being superior to the long-stay Ss. 
Tables 1 and 2 show the analyses of variance 
of initial level of performance of the schizo- 
phrenic Ss in Periods 1 and 2. 

An increase in speed of performance after 
rest was shown by 26 out of the 30 Ss who 
improved their scores when beginning Period 
2. By the sign test, this increase is significant 
at the .01 level. 

When a comparison was made between the 
depressives and the schizophrenics who re- 
sembled them most, the 10 Ss aged 40 and 
over, it was found that the performance of the 
two groups differed significantly in slope, the 
difference being significant at the .05 level for 
both Periods 1 and 2 (¢ = 2.33 in each case). 
The depressives rose steadily in performance 
throughout both Period 1 and Period 2, 
whereas the schizophrenics’ performance rose 
much less in Period 1 and declined in Period 2. 

The schizophrenic group improved con- 
siderably in performance after rest (a finding 
also reported by Kraepelin), whereas the 
depressives showed only a slight improvement. 


“Improvement scores” were calculated for each 
patient by subtracting the last score of Period 
1 from the first score of Period 2. Owing to the 
wide scatter of schizophrenic scores, there was 
no significant difference between the “im- 
provement scores” of the two groups of Ss 
heing compared when a ¢ test was used (¢ = 
1.22). However, the nonparametric median 
test gave a highly significant chi square be- 
tween the two sets of scores (x? = 8.18 = 
~ < .01). Since all six schizophrenic groups 
show a larger rise in level of performance after 
rest than do the depressives, the median test 
probably gives a valid indication of a true 
difference between the groups as far as incre- 
ment in performance after rest is concerned. 


DISCUSSION 

The findings of Kraepelin of an immediate 
fall in the performance of schizophrenics have 
not been confirmed in this study, although im- 
provement after a pause and subsequent 
deterioration in performance were shown. The 
initial rise in performance shown here is most 
likely due to the effect of practice on an 
unfamiliar task. That a pilot study pre- 
liminary to the one reported here showed 
marked practice effects on an addition task, 
contrary to Kraepelin, may be because his re- 
sults are expressed as the means of perform- 
ance over 10 days, during which time his 
subjects would become practiced on the task. 
Also, Kraepelin’s Ss, details about whom are 
not given, were possibly of a higher educa- 
tional level. 

In spite of these differences, both the current 
findings and those of Hoch and Kraepelin can 
be subsumed under a theoretical framework 
derived from Pavlov and stated in Hullian (2) 
terms. Pavlov (5) stated that schizophrenics 
were characterized by a “weak” nervous 
system with a predominance of inhibitory 
potential, whereas depressives presumably had 
a predominance of excitatory potential. 

In Hullian terms, schizophrenics might be 
said to show an exaggerated tendency to 
develop reactive inhibition in contrast to 
depressives. In Kraepelin’s experiment, the fall 
in performance could thus be attributed to a 
rapid development of reactive inhibition in a 
familiar task. That is, the change in sx is 
predominantly due to the development of Jp. 
In the present results, the increasing sE, in 
schizophrenic performance may be conceived 
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as a function of increasing sHx. The sEz is, 
however, depressed by a development of J. In 
comparison, the depressives, having a weaker 
tendency to develop reactive inhibition, 
showed a performance where sE, was almost 
solely a function of increasing sHx. In both 
studies, the increase in performance after rest 
could be due to the dissipation of J, the 
rather more marked improvement with 
Kraepelin’s subjects being due to the longer 
(five-minute) rest pause which he employed. 
Hoch’s results also seem consistent with the 
interpretation of little development of reactive 
inhibition among depressives. 

While the findings in these three studies are 
thus susceptible to a theory of differential 
development of reactive inhibition, other ex- 
planations are possible. Kraepelin himself 
dismissed the notion of fatigue as being re- 
sponsible for the decrement in schizophrenic 
performance and favored an explanation in 
terms of diminishing “willed attention” 
(Willenspannung). The present finding of an 
initial improvement in performance would ap- 
pear to run counter to his explanation. 

The superiority in level of performance 
shown by the short-stay over the long-stay 
group of schizophrenics is a not unexpected 
finding which exemplifies the general slowness 
of response presented by most institutionalized 
patients. An explanation of this schizophrenic 
deterioration in terms of lack of drive (D) is 
possible, but like the theoretical interpretation 
presented earlier, must await further experi- 
mentation for its resolution. 


AND J. TIZARD 


SUMMARY 


When given a repetitive task to perform for 
two successive periods of 10 minutes, non- 
paranoid schizophrenics tended to show little 
improvement in performance with practice 
during the first period, a rise in level of per- 
formance after rest, and a subsequent fall in 
level during the second period. Endogenous 
depressives, on the other hand, showed an im- 
provement in performance throughout both 
periods, but little increase in level of perform- 
ance after rest. Their performance differed 
significantly from that of the schizophrenics in 
these respects. Within the groups of schizo- 
phrenics, there were differences in response rate 
depending on length of stay in hospital. 

The findings support results obtained by 
other workers and are consistent with a theo- 
retical framework stated in terms of differen- 
tial susceptibility to reactive inhibition. 
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THE LEARNING AND UTILIZATION OF CONTRAVALUANT MATERIAL! 


EDWARD E. JONES anp JANE ANESHANSEL 


Duke University 


URING the past two decades, there have 
been several attempts to investigate 
the effects of attitude, set, and frame 

of reference on the learning of meaningful prose 
material. The general form of these studies has 
involved the comparison by various measures 
of immediate and delayed recall, of groups 
known or presumed to differ in their attitude 
toward the content of the material to be 
learned. In an early study, Laird (5) found that 
optimistic Ss recalled significantly more pleas- 
ant items than did pessimistic Ss. Widely 
separated studies by Clark (2) and Alper and 
Korchin (1) have both explored differences 
between the recall by males and females of 
controversial prose material presumably rele- 
vant to attitudes determined by sex identifica- 
tion. Both studies resulted in differences be- 
tween recall scores of males and females which 
were interpretable in terms of the material 
presented. Watson and Hartmann (9) found 
that theistic and atheistic students tended to 
remember better material supporting their 
‘attitudinal frame,”’ although most of the dif- 
ferences in their study were not statistically 
significant. With a larger group of Ss and a 
more sensitive method of quantitative analy- 
sis, Edwards (4) came to the same general con- 
clusion with regard to Ss differing in their 
attitude to the New Deal. The most familiar 
study in this vein is the clear-cut demonstra- 
tion by Levine and Murphy (6) that prose 
which is congenial either to pro-communists or 
to anti-communists is more rapidly learned and 
more slowly forgotten than prose which pre- 
sents arguments running counter to the S’s 
attitudinal bias. 

In general, considering the difficulties in- 
volved in the reliable assessment of relevant 
attitudes, the assumption that we more easily 
learn that which is congenial to our own views 
appears to have fairly solid empirical support. 
While previous authors have generally been 
cautious in generalizing their findings to other 
attitudes in other learning situations, it has 
become convenient to stress the autistic nature 

1 The authors are indebted to Drs. Gregory Kimble 


and Harry I. Kalish for their helpful suggestions and 
general facilitation of the research. 


of meaningful learning at the expense of other 
motivational effects which may operate to in- 
hibit or actually reverse the outcome predicted 
by a simple autism theory. This paradox has 
been recognized by many of the students of 
social perception who have explored the polar 
mechanisms of perceptual defense and percep- 
tual vigilance. To resolve this paradox, the 
functionalist might suggest that we examine 
the total context in which perception (or learn- 
ing) takes place, in an effort to determine those 
conditions which promote lowered thresholds 
for threatening material (perceptual vigilance) 
and those which promote heightened thresholds 
(perceptual defense). On the basis of our analy- 
sis of these conditions we should be able to 
decide whether defense or vigilance will have 
greater functional utility in a given situation, 
and to predict the corresponding direction in 
threshold variation. 

If we assume a continuity between percep- 
tion, learning, and retention (and we would be 
in good company in doing so), there must be 
conditions under which the learning counter- 
part of perceptual vigilance will occur. It is 
somewhat surprising that as yet no one has 
thought to explore the conditions under which 
uncongenial or contravaluant material is suc- 
cessfully learned. It is rather easy to think of 
exceptions to the proposition that our minds 
are furnished largely with covaluant ideas. One 
need merely think of the political campaigner 
digging into his opponent’s record, or the in- 
telligence agent plotting the course of Soviet 
diplomacy, or the liberal who quotes Time 
magazine with shocked bitterness, to appreci- 
ate that we do not always surround ourselves 
with ideological yes-men. It is obviously an 
important psychological question, however, 
whether our pains to learn an opponent’s views 
are rewarded with accurate learning and reten- 
tion, and whether the conditions under which 
such learning is facilitated may be specified. 

The present paper attempts to evaluate the 
effectiveness of a particular set of conditions 
in promoting the learning of contravaluant or 
uncongenial prose sentences. In brief outline, 
we propose to test the hypothesis that Ss who 
are presumed to disagree with a set of state- 








bho 
Co 


ments will actually learn them better than Ss 
who agree when retention of the statements 
serves a particular function in a later part of 
the experiment. Specifically, when Ss are told 
that they will be asked to provide appropriate 
counterarguments for prosegregation state- 
ments, prosegregation Ss will learn a set of 
antisegregation statements bet/er than antiseg- 
regation Ss. In keeping with the traditional 
argument, however, antisegregation Ss should 
learn more antisegregation statements when 
there is no anticipation of a counterargument 
task. 


METHOD 


Subjects. A 10-item Likert-type scale was devised for 
the experiment as a measure of attitudes toward segre- 
gation. The scale was pretested on 151 introductory 
psychology students and one itera was subsequently 
modified because it showed a poor correlation with 
total score. The final form of the scale was administered 
to 147 different introductory students and the scores 
ranged from 61 (out of a theoretically possible high of 
70) to 15 (out of a possible low of 10). The distribution 
of scores was approximately normal for both groups. 
While the segregation questionnaires were filled out 
anonymously, the 20 highest and 20 lowest Ss were 
reliably identified by other data and were successfully 
approached to serve as Ss in the apparently unrelated 
experiment. The scores of the prosegregation Ss (here- 
after called “pros’’) ranged from 61 to 38; the scores of 
the antisegregation Ss (hereafter called “‘antis’’) 
ranged from 15 to 22. 

Segregation attitude scale. Each of the 10 items of the 
segregation scale was explicitly related to the segrega- 
tion issue. Although the scale itself was unlabeled, no 
special pains were taken to disguise intent because of 
the nature of our sample. The origins of the Duke stu- 
dent budy are geographically diverse (with slightly 
over half coming from states in which segregation is 
sanctioned and practiced), the issue of segregation is a 
lively one on the campus, and there is no clearly speci- 
fied college norm which would promote the uniform 
denial of prosegregation statements. Five of the scale 
items were slanted in the prosegregation direction 
(e.g., “Any attempt to dea! with segregation ignores the 
oft proven maxim that you can’t legislate morals”) and 
five were slanted in the antisegregation direction (e.g., 
“The Negroes’ major concern is with equal educational 
and economic opportunities. They have no intentions 
of interfering with the social patterns of the white 
community”’). 

The validity of the scale is one of the issues at stake 
in the experiment itself, of course, but there are two 
independent sources of evidence that suggest that the 
scale is sufficiently valid for the purposes of the study. 
The original pretest group of 151 Ss was simultaneously 
administered a combined version of the California 
Ethnocentrism and Anti-Semitism scales (with some 
additional pro-Semitic items included to break any 
response set). This combined scale—the majority of 
whose items deal with attitudes towards Jews—corre- 
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lated .41 with the segregation scale. Perhaps more re- 
vealing, in terms of validity, is the finding that the 
mean segregation-scale score of Northerners (29.7) 
differs significantly (p < .01) from the mean segrega- 
tion score of Southerners (35.4). This contrast is in 
spite of the fact that the general prejudice scores of the 
two groups are indistinguishable (89.5 vs. 89.6). It 
would seem legitimate to assume that the Ss selected 
for the experiment itself had rather consistent beliefs 
either for or against segregation. 

Experimental procedure. Each of the 40 Ss individ- 
ually attempted to learn 11 brief statements which 
were clearly loaded in the direction of arguing against 
segregation. The statements were printed on individual 
cards, and each S was instructed to read each card aloud 
at normal reading speed. After all 11 statements had 
been read in this manner, the S was asked to reproduce 
as many statements as he could remember. This identi- 
cal procedure was repeated for a total of five trials. The 
order of the statements differed randomly on each 
trial, but the order was constant across Ss. All attempts 
at reproduction were transferred verbatim to specially 
devised recording sheets in order to minimize recording 
error. In addition, the experimenter was usually un- 
aware of whether a particular S was pro- or antisegre- 
gation. 

After the five learning trials, each S was presented 
with a list of 11 prosegregation statements, and asked 
to provide appropriate counterarguments for each 
statement as rapidly as possible. Finally, each S was 
asked to match each pro- with each antisegregation 
statement, and told to indicate which of the statements 
came closer to his own beliefs 

While all of the Ss followed this same general pro- 
cedure, half of the Ss (10 “pros” and 10 “antis”) were 
forewarned that they would be asked to provide 
counterarguments for prosegregation statements. These 
Ss will be hereafter designated as the experimental 
group. Since the precise nature of the instructions to 
this group is important in evaluating the experiment, 
they are reproduced below: 

“This is going to be an experiment to see how well 
you can think up appropriate counter-arguments for 
controversial statements. You will be presented with a 
number of statements which all argue more or less in 
favor of segregation. Your task will be to look at each 
statement in turn, read it aloud, and then give me an 
appropriate counter-argument.... (They were then 
provided with an example of a statement and its 
counter-argument.). . .Before getting on with the main 
part of the experiment, I am first going to show you 
some anti-segregation statements like the ones you may 
want to use as counter-arguments. Your first job is to 
learn these statements as quickly and completely as you 
can. Remember, you may want to use some of the 
statements in the main part of the experiment later on, 
so it will pay to learn these statements.” 

The other half of the Ss (the control group) were 
simply instructed to learn the antisegregation state- 
ments. No reference was made to the possible use of 
these statements in a subsequent part of the experiment. 
As indicated above, however, all Ss actually were 
eventually placed in the same position of using the 
statements as counterarguments. 

The following examples illustrate the kind of state- 
ments which the Ss were originally asked to learn. Each 
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antisegregation statement is followed by its appropriate 
counterargument (as determined by the consensus of 
several graduate student judges). The counterargu- 
ment, of course, only appeared along with other coun- 
terarguments during the second phase of the 
experiment 

1. The issue of Negro-White integrated education 
has nothing to do with racial intermarriage. 

Counterargument: The Negro mulatto is a symbol of 
what would happen after desegregation. 

2. The Negro points up the greatest disparity be- 
tween the theory and our practice of democracy. 

Counterargument: The democratic philosophy in- 
cludes the right to associate with whomever you please. 

No claims need be made for the logical or empirical 
validity of any of these statements since they were 
intentionally stated in somewhat controversial form. 


RESULTS 


The Learning of Controversial Statements 


The major analysis involved scoring for 
accuracy each reproduction attempt for each S 
for each trial. The 11 antisegregation state- 
ments were broken down into 22 idea units and 
these in turn were scored for accuracy in terms 
of the following three categories: 1, the idea 
unit is identical to the original, or modified only 
in some incidental way (e.g., the S says “‘be- 
cause of’’ whereas the original reads “‘a function 
of’’); 2, the idea unit is identical in meaning to 
the original, though it may vary in phraseol- 
ogy, through the use of synonyms, etc. (e.g., 
the S says “integrating the Negroes with the 
whites in education” whereas the original 
reads ‘“‘Negro-White integrated education’’); 
3, the idea unit clearly conveys a different 
meaning than was intended by the original. 
Ten randomly selected response protocols (con- 
sisting of 632 idea units) were independently 
scored by a second rater. There was 89 per 
cent agreement on item placement. 

The major hypothesis to be tested is that 
the Experimental “‘pros’”’ will learn more idea 
units than the Experimental ‘‘antis,’”’ and the 
Control “‘antis” will learn more idea units than 
the Control “pros.” In order to test this hy- 
pothesis, units learned were placed in a 2 (for 
replicated Ss) by 2 (for conditions) by 5 (for 
trials) factorial design and, following Lindquist 
(7), a Type III analysis of variance was con- 
ducted. Three separate analyses were made, 
covering respectively the number of units ex- 
actly reproduced (Scoring Category 1), those 
which were reproduced with unchanged mean- 
ing (Category 1 + 2), and total units tried 
(1 + 2 + 3). These three analyses were con- 


ducted in order to test the hypothesis under 
investigation at different levels of accuracy. 

Turning first to the analysis based on the 
very stringent accuracy criterion of Category 
i, the over-all means (combining trials) indi- 
cate a slight tendency toward the predicted 
interaction but the F ratio is far from signifi- 
cant. As Table 1 shows, however, there is a 
significant triple interaction between Ss, trials, 
and conditions. Figure 1 reveals that this is a 
function of clear-cut fluctuations in the rela- 
tions between the four cell means from trial to 
trial. Trials 3 and 5 provide the clearest exam- 
ple of the predicted interaction between Ss and 
conditions, but separate single analyses of 
variance for these trials result in moderate but 
nonsignificant interactions. 

When Category 1 is combined with Category 
2 to provide a still meaningful but less strin- 
gent measure of accuracy of reproduction, the 
analysis of variance provides positive confirma- 
tion of the main hypothesis. As Table 1 and 
Fig. 1 show, at every trial Experimental “pros” 
learn more antisegregation units than Experi- 
mental “antis,” and Control “pros” learn less 
than Control “antis.”” Except for the marked 
improvement in score as a function of trials, 
no other interaction or main effect is significant, 
and the application of Bartlett’s test shows 


TABLE 1 


LEARNING AT THREE LEVELS OF ACCURACY 
(“Pros” vs. “antis” under experimental and control 
conditions, analysis of variance) 





Stringency of Accuracy Criterion 
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Score1+2 | 1+2+3 
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Error | HH 36. 35) 








Within 
C: Trials | 4/465. 09|112.48°*|687.49)214. 18°*/674. 28|156.93°* 
AXC |} 4.84) — 1.73] — 6 — 
BXxcCc 4) 2.34, — ’ 63) — 
AXBXC | 4) 12.45) 3.01° 
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Fic. 1. MEAN Unrts TRIED OR LEARNED AT Two 
LEVELS OF ACCURACY PER TRIAL. PROSEGRE- 
GATIONISTS VS. ANTISEGREGATIONISTS UNDER 
EXPERIMENTAL (E) anp Controt (C) 
CONDITIONS 


that the variances in each cell are homogene 
ous. Within conditions, / tests reveal a signifi- 
cant (p < .05) difference between “pros” and 
“antis” under experimental but not under 
control conditions. 

Since the second analysis of variance resulted 
in a significant interaction, it is only of inci- 
dental importance that the third analysis of 
variance (dealing with the total number of 
units tried) resulted in almost identical F 
ratios for all main effects and interactions, and 
the main effect of trials plus the conditions by 
Ss interactions are again the only significant 
sources of variance. A glance at Fig. 1 tells us 
that Ss seldom responded incorrectly if they 
responded at all, and the fact that the slopes 
indicating the shift in mean as a function of 
conditions are uniformly parallel suggests that 
nothing would have been gained by devising a 
ratio score reflecting the percentage of accurate 
reproductions out of the total tried. In other 
words, reproduction errors were apparently 
distributed randomly across trials, conditions, 
and S groups. 

In general, then, the analyses offer strong 
support for the main hypothesis of the study if 


we use a “meaningful” rather than a strictly 
verbatim criterion of accuracy. One apparent 
reason why the predicted interaction is signifi- 
cant under the lenient but not the stringent 
criterion of accuracy is that summing Categories 
1 and 2 naturally results in a greater range of 
scores than Category 1 alone, thus providing a 
more reliable quantitative estimate of recall. 
However, a more purely psychological factor 
may also be operating to attenuate the pre- 
dicted interaction under stringent accuracy 
conditions. Doob (3) found that whereas Ss 
tended to recall an equivalent number of con- 
gruent vs. incongruent paragraphs, a percent- 
age measure of recall accuracy based on 
individual statements showed that items with 
which the S agreed were recalled more correctly 
than incongruent statements. Thus it may be 
that strict accuracy of recall is determined 
largely by one’s attitude toward the material 
and is not as subject to modification by varia- 
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tion in motivating instructions as the total 
amount learned as judged by less stringent 
criteria. Nevertheless, it should be pointed out 
that the more lenient (1 + 2) measure of 
accuracy, which results in confirmation of the 
prediction in the present case, is very much a 
measure of “effective accuracy” since the 
meaning of the statement had to be preserved 
without modification to receive a score of 2. 
In any event, conditions may apparently be 
established which can effectively reverse the 
customary conclusion that Ss will show better 
learning of congenial than uncongenial mate- 
rial. It is remotely possible that the present 
results reflect sampling errors in recruiting 10 
Ss for each cell. It may be, for example, that 
the “pros” who ended up in the Experimental 
group were more intelligent, or more rapid and 
retentive readers than the Control “pros.” This 
would not appear to be the case, if we can place 
any confidence in Total ACE scores as a meas- 
ure of intelligence. While the “pros” do differ 
reliably from the “antis” in Total ACE (the 
difference between scores favoring the “‘antis,” 
p < .05), there are no such differences in read- 
ing skill, or in either reading or total scores of 
the experimental versus the control “‘pros.”’ 


The Utilization of Statements as Counterargu- 
menis 
The primary purpose for presenting pro- 


segregation statements to the Ss and eliciting 
appropriate counterarguments was to fulfill the 
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conditions of the experiment for the Ss. How- 
ever, we thought it was possible that “‘pros”’ 
and ‘‘antis” would differ in the appropriateness 
of the argument chosen, and that perhaps the 
“antis” would show a less heavy reliance on the 
antisegregation statements learned. A scoring 
system similar to that used on the learning data 
was applied to the counterarguments chosen by 
the Ss. Five instead of three scoring categories 
were devised in an attempt to gain a more sensi- 
tive measure of variations in the kind of state- 
ment chosen as a counterargument: 

1. entire statement essentially correct; 

2. in spite of changes in phrasing, the mean- 
ing of the statement is correct; 

3. the statement is appropriate but novel 
or creative; 

4. an earlier learned statement is inappropri- 
ately applied; 

5. nonsense statements and garbled versions 
are produced. 

A series of 2 X 2 analyses of variance were 
conducted for each category and for various 
meaningful combinations of them, and the re- 
sults are inconclusive. In eight separate analy- 
ses, not a single main effect or interaction was 
significant. There is a consistent, but never 
significant, tendency for the antisegregationists 
to be more “creative” under experimental con- 
ditions. Aside from this, there is little consist- 
ency in these data. 

When the Ss were permitted to examine both 
lists of statements simultaneously and asked to 
match appropriate counterarguments, there 
were only a few “errors” in the entire sample of 
40 Ss. The fact that 24 out of 40 Ss had com- 
pletely correct matches supports the validity 
of our a priori selection of meaningful opposites. 
Perhaps because of the over-all high level of 
accuracy, there were no significant group differ- 
ences in matching errors. 

The final task for the Ss, it will be remem- 
bered, involved circling the alternative in each 
pair of counterarguments which best reflected 
their attitude. It should be noted that whether 
or not a “pro” indicates a preference for pro- 
segregation statements is a relevant but far 
from decisive test of the validity of the segre- 
gation scale and the appropriateness of the 
statements to be learned. The task of circling 
the preferred statement was completed under 
conditions of surveillance and nonanonymity, 
and furthermore, the statements were generally 
stated in an extreme form, so a clear-cut reflec- 


tion of true attitude would not be expected. It 
is not surprising, therefore, that the modal 
number of Ss circled 9 antisegregation state- 
ments and 2 prosegregation statements. With 
the exception of the 9 out of 40 Ss in this modal 
group, however, there is practically no overlap 
between “‘antis” and “pros.” Application of 
the nonparametric Median Test results in a 
highly significant difference between prosegre- 
gation and antisegregation Ss in the relative 
tendency to check prosegregation statements. 
The fact that this difference appeared is en- 
couraging evidence for the validity of the atti- 
tude scale and also for our assumption that the 
statements would be relatively more congenial 
to one group than to the other. 


DISCUSSION 


The results of the present experiment indi- 
cate that under certain cenditions, Ss who 
disagree with a set of statements will actually 
learn them better than Ss who agree with the 
statements. This conclusion complicates the 
picture presented by those who favor consistent 
explanations in terms of need-relevant per- 
ceptual and mnemonic selectivity. The tradi- 
tional interpretations of meaningful learning 
which stress autism, perceptual selectivity, or 
congruence with the learner’s frame of refer- 
ence, receive support under the standard con- 
trol conditions of the present experiment, but 
a rather slight modification in experimental 
instructions is sufficient to reverse the conven- 
tionally predicted relationship. It would 
appear, then, that we tend to be autistic in the 
retention of congenial material under condi- 
tions of low motivation and circumscribed time 
span. When the stakes are raised and the con- 
text of behavior is enlarged by increasing the 
instrumental significance of learning contra- 
valuant material, something like vigilance 
appears. For the prosegregationist under the 
experimental conditions, the comfort of sur- 
rounding himself with congenial ideas is pitted 
against the comfort of performing well the 
stated experimental task. In this particular 
instance the latter factor won out. When the 
contravaluant material is given a new utility 
for the learner, the wish that is otherwise 
autistically pampered is supplanted by more 
powerful motives of ego-enhancement (sub- 
suming implicit competition with other Ss as 
wel! as the hope of a kind word of reward from 
the experimenter). While this interpretation 








supplements the restricted theories of autism 
and covaluant selectivity, it neither supplants 
nor contradicts such theories. In particular, the 
present results support the functionalistic or 
pragmatic notion that we learn best that which 
is instrumental to adaptation and survival. 

The interpretation outlined above would 
hold equally well if the predicted differences 
observed between “pros” and “antis” in the 
control group were merely eliminated under 
the experimental instructions. But why should 
the “pros” actually surpass the “antis” in the 
learning of antisegregation material? The clue 
to this paradox, we suggest, lies in the general 
contention that different individuals can be 
differentially motivated by the same set ot in- 
structions. We assume that both “pros” and 
“antis” are about equally motivated to do well 
in the experiment—i.e., they both want to pre- 
pare themselves for the task of countering pro- 
segregation arguments with appropriate anti- 
segregation statements. While the ‘“antis” 
presumably come trained and primed for such 
a task, the “pros”’ face the prospect of adducing 
contravaluant arguments like a nervous under- 
study rushed on stage to play an unfamiliar 
character type. Since the “anti” comes to the 
experiment more confidently familiar with the 
arguments which favor his cause, he is less 
inclined to view the initial learning task as a 
critical prerequisite for the fulfillment of the 
main experimental requirement. The “pro,” 
on the other hand, is forced to rely on rote 
learning in order to comply with the experi- 
mental task and therefore he does a better job 
in memorizing the presented antisegregation 
statements. There is some empirical support for 
this otherwise plausible interpretation in the 
fact that the “antis” did tend to be more “‘crea- 
tive” in their final rebuttals than the “pros,” 
but this tendency was neither striking nor 
statistically reliable. 

If we had introduced a third condition in 
which an extrinsic monetary reward was prom- 
ised for successful learning, the same results 
would not be predicted. Assuming that the 
proposed monetary reinforcement was sufficient 
to raise the level of motivation to a high level 
in all Ss, we would predict an absence of any 
differences in learning between the two attitu- 
dinal groups—or rather, those differences that 
did exist would be a function of ability ceiling 
and not attitude. The reversal effect obtained 
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in the present experiment is dependent on more 
subtle reinforcement potentials intrinsically 
relevant to the task at hand. What we appar- 
ently have in the present case is the operation 
of self-instructions or covert sets (8) which in- 
volve different interpretations of the overt 
instructions. The “pros” and the “antis” are 
both motivated to attain the same goal, 
but they tell themselves different ways to 
achieve it. 


SUMMARY 


The assumption that we tend to learn state- 
ments that we agree with better than state- 
ments with which we disagree must be placed in 
the broader context of the learner’s over-all 
purpose in the task. An experiment was con- 
ducted in which prosegregation and antisegrega- 
tion Ss were asked to learn a series of antisegre- 
gation statements. Half of the Ss proceeded 
under the assumption that these same state- 
ments could well be used as counterarguments 
for prosegregation statements to be presented 
in a subsequent portion of the experiment. The 
remainder of the Ss were simply asked to learn 
the statements as well as possible. The results 
demonstrate quite conclusively that prosegrega- 
tionists will learn antisegregation statements 
better than antisegregationists when a subse- 
quent debate is anticipated. Without such an 
anticipation, however, the customary finding 
holds—i.e., the antisegregationists make better 
progress than the prosegregationists in learning 
the congenial material. No significant differ- 
ences were found with regard to the utilization 
of antisegregation counterarguments in the 
debating task. The results were interpreted in 
terms of the differential availability of means 
to complete the experimental task. Since the 
antisegregationists are initially better prepared 
to fulfill the ultimate experimental requirement, 
they are less highly motivated to learn the con- 
genial statements when a cebate is anticipated. 
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AUTHORITARIANISM AND REPRESSION! 


NATHAN KOGAN 


Harvard U 


ECENT have witnessed an _ in- 
creasing number of studies experi- 
mentally confirming many of the 

propositions set forth in The Authoritarian 


(1). With few exceptions, investi- 


years 


Personality 


gators have focused upon the cognitive styles of 


ethnocentric and authoritarian individuals 


(2, 3, 4, 7, 8). Generally overlooked is the 
tendency to repress, one of the most basic 
features asserted to be typical of authoritarian 
personalities. According to Adorno, ef al. (1), 
the highly authoritarian individual does not 
become aware of undesirable or hostile feelings 
toward parental figures. Feelings of inner 
weakness or dependency are similarly denied. 
The nonauthoritarian individual, according to 
these authors, is more likely to be an intellectu- 
alizer. He freely admits into awareness ambiva- 
lent feelings about parental figures and, in 
addition, conflicts in the sexual and aggressive 
areas are subject to conscious reflection with 
consequent guilt feelings. 

Hyman and Sheatsley (5) in a methodologi- 
cal critique of The Authoritarian Personality 
have cast considerable doubt upon the data 
from which the presence of repression was in- 
ferred. An experimental examination of the 
relationship between repression and authori- 
attitudes is, therefore, of 
particular the time. 
Given the postulated link between repression 


tarian-ethnocentri 


relevance at present 


and authoritarianism, the hypothesis is tested 
that recognition accuracy for stimuli consisting 
of disparaging aggressive and sexual references 
to parents and self is inversely related to the 
strength of authoritarian and ethnocentric at- 
titudes. 


1 The present research is based upon portions of a 
Ph.D. thesis submitted to the Department of Social 
Relations, Harvard University. I am especially in- 
debted to Professors G. W. Allport and J. S. Bruner 
for their invaluable suggestions and criticisms. Dr. 
R. Tagiuri read the manuscript and contributed 
many helpful comments. I am grateful for the assist- 
ance rendered by Mr. M. B. Schlank, who served as 
the independent judge. This research was facilitated 
by a Sigmund Livingston Fellowship awarded under 
the auspices of the Anti-Defamation League. 


niversity 


METHOD 


In testing the hypothesis a technique first employed 
by Lazarus, Eriksen, and Fonda (6) was used. Forty- 
five sentences partially masked by white noise were 
recorded on tape. The first three sentences, all emo- 
tionally neutral in content, were practice items. There 
were 42 experimental sentences—14 neutral, 14 sexual, 
and 14 aggressive. Sexual and aggressive content was 
either directed against the self in the form of expres- 
sions of guilt and personal inadequacy, or against 
parents or people in general. Neutral sentences were 
included to take into account individual differences in 
auditory acuity. The three major types of sentences 
were randomized in presentation. Two examples of 
each—neutral, aggressive, and sexual, respectively— 
are presented below.” 

“You like to go swimming during the summer 
season 

“The thought of a good meal increases your ap- 
petite 

“Only your death would be a just punishment for 
you. 

“Your mother is to blame for your worst faults. 

“You have been unable to break your ugly sex 
habits. 

“You would have loved to share your father’s sex 
life.” 

The auditory perception test was group adminis- 
tered. A tape recorder was situated in one corner of 
the experimental rcom and connected to a speaker 
placed in a prominent position approximately equidis- 
tant from all Ss. The instructions were recorded on 
tape and played over the speaker prior to the presenta- 
tion of the first sentence. These instructions were as 
follows: 

“You are now hearing my voice as recorded on a 
tape recorder. You should be able to hear me quite 
clearly. In just a moment I shali introduce a noise 
background along with my voice, and the experiment 
will begin. Do not hesitate to write down everything 
you hear regardless of how shocking or improbable it 
may sound. Even though you may not hear the entire 
sentence, try to make a meaningful sentence out of 
what you do hear. Guess when you are not sure. We 
will now begin.” 

The Ss were encouraged to make meaningful sen- 
tences to reduce the prospect of nonsense responses. 
Warning Ss of the shocking nature of the sentences 
was intended to discourage withholding of written 
responses to accurately perceived stimuli. 

The first three sentences were practice items. The 
E presented each of these sentences, with Ss writing 
down what they heard. Then, E read the sentence 
aloud in its original form and replayed it again on 
tape (with noise background). The Ss could recognize 


2A copy of the complete sentence list can be ob- 
tained from the author. 
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the sentence despite the noise after having been in 
formed of its content. All Ss were thus convinced that 
they were not hearing meaningless sounds. 

The 42 experimental sentences followed, the re- 
cording being stopped after each sentence to allow Ss 
to write down what they heard. The Ss were constantly 
cautioned not to look at each other’s papers. Strict 
conformity to this rule was urged, since checking of 
responses among Ss would, in view of the emotional 
quality of the stimuli, obviously lead to inhibition of 
response. 

All sentences received a quantitative score ranging 
from 0 to 4. The rating scale is described below: 

- perfect recognition. 
pertect recognition, minor 


3—almost errors or 


omissions 

2—basic meaning of 
distortions or omissions in wording. 

l1—meaning of recognized, crucial 
omissions or distortions in wording, though at least 
half of the words in the sentence are used 

O—a blank, or mere stub of sentence perceived 
correctly; also a completely irrelevant response 


sentence recognized, with 


sentence not 


The individual sentence ratings were summated for 
the three types of sentences. To equalize the auditory 
acuity factor, a sex/neutral and an aggressive/neutral 
ratio was computed for each S. 

The investigator scored all of the sentences for all 
Ss. To guard against the possible influence of rating 
bias, the sentence protocols were scored before the 
determination of the attitude 
graduate student in clinical psychology also scored 
all of the sentences for Ss in the upper and 
quarters on the E and F scales which had been previ- 
ously administered (see below.). The scoring key above 
was given to the judge, and illustrative examples were 
provided for each of the scoring categories. In no case 
did an illustrative example correspond to a sentence 
response in the data. The sex/neutral and aggressive/ 
neutral ratios yielded Spearman rank-order correla- 
tion coefficients of .90 and .93, respectively, between 
the investigator and the independent judge. 

The range of individual differences in auditory 
acuity for the present type of stimulus material would 
seem to be quite wide. Pretest Ss werv able correctly 
to perceive, on the average, about one out of every 
three sentences presented. A small number, however, 


scores. An advanced 


lower 


obtained a 
few of the words in any of the sentences. In the course 
of assigning quantitative scores to the experimental 
Ss, it became apparent that a considerable number of 
Ss failed to recognize enough of the stimulus material 
to yield a protocol of any value for quantitative 
scoring. A score of 10 assigned by each judge was 
therefore established as a minimum for the retention 
of a case. It should be noted that the reliability coeffi- 
cients reported above are based upon Ss whose scores 
are above this arbitrary cutting point. 

Consider next the middle range Ss whose sentence 
protoco.ss were scored only by the investigator. Again, 
a score of 10 served as a cutting point, but the selection 
or elimination of an S rested exclusively upon the 
investigator’s judgment. No great risk was involved, 
however, in view of the extent of agreement between 
the judges for upper and lower quartile Ss. Thus, the 


zero score, 2... perceived none or very 


mean score for retained Ss was 30.5 for the investigator 
and 31.6 for the independent judge. The mean dif- 
ference yields an SD of 2.8. Since the lowest acceptable 
score among middle range Ss was 16, it is highly un- 
likely that any of them would have been eliminated 
had their sentence protocols been scored by the inde- 
pendent judge. 
Of the 80 Ss 
large New England 


white Christian male freshmen at a 


university—who participated in 


the perception study, 37 satisfied the criterion of 


recognition. The present report is based upon data 
yielded by these 37 Ss. 

The Ss had filled out a 30-item attitude battery 
prior to their participation in the perception experi- 
ment. The California E and F scales each furnished 12 
items.* Since all of these items are worded in the same 
direction with the item is scored as 
authoritarian or ethnocentric—five items of opposite 
tendency were included to avoid the possible develop- 
ment of a negative response set in the low group. They 
were not scored. 

The Ss could assign one of six ratings to each item 
ranging from +3 strong agreement) to —3 
(very strong disagreement). These raw values were 
converted into scores ranging from 1 to 7, with a 
score of 4 assigned to the rare case of an omitted item. 
An S’s score on E and F was obtained by computing 
the average of the converted ratings assigned to the 
twelve items selected from the respective scales 


agreement 


(very 


RESULTS 
r’s* are —.39 between F 
and —.38 


moment 
sex/neutral 


Product 
scores and ratios 

*The writer wishes to thank Dr. D. J. Levinson 
for his assistance in the selection of items. Timeliness 
and satisfactory Discriminatory Power values dictated 
the choice of items. In addition, there was at least 
one item representing each aspect or subsyndrome of 
ethnocentric and authoritarian ideology. Eleven E- 
Scale items were taken from Table 19, p. 142 of The 
Authoritarian Personality—1, 2, and 3 of subscale A; 
1, 2, and 6 of subscale B; and 2, 4, 6, 7, and 8 of sub- 
scale C. A sixth “patriotism” item—“The first thing 
foreigners in this country should learn is the true 
American way of thinking and acting’”—was sub- 
stituted for less timely items in subscale C to make a 
12-item scale. Eleven F-Scale items were selected 
from Forms 45 and 40, pp. 255-257—4, 8, 9, 13, 18, 
21, 25, 26, 34, 37, and 42. The twelfth item, a revision 
of Item 32 in Form 78, p. 226, rounded out the scale: 
“The best teacher or boss is the one who tells us just 
exactly what is to be done and how to go about it.” 
Both new items have yielded satisfactory Discrimina- 
tory Power values in administrations of the E and F 
scales subsequent to the publication of The Authori- 
tarian Personality. 

‘Product moment r’s are justified in the present 
case, for the sample distributions of ratios were es- 
sentially normal. The mathematical property of ratios 
is that they approach zero as the numerator decreases 
and approach infinity as the denominator decreases. 
Hence, a markedly skewed population distribution 
would be the statistical expectation. Since a minority 
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TABLE 1 
MEAN RECOGNITION AccuRACY SCORES FOR Ss IN 
THE EXTREME QUARTERS OF THE F-SCORE 
DISTRIBUTION 
Judge A* Judge B* 
Sentence Lower Upper Lower Upper 
Type Q Q Q Q 
Neutral .95 .88 .97 .95 
Sexual .82 .37 .82 43 
Aggressive .86 .46 ey -46 








Note.—N = 11 for both upper and lower quarters. 
* Judge A is the investigator and Judge B is the independent 
judge. 





between F scores and aggressive/neutral ratios. 
These values are significant beyond the .01 
and .02 levels (one-tailed test), respectively. 
The correlation coefficients are negative since 
stronger authoritarian attitudes are associated 
with lower ratios, i.e., less accurate recognition 
of the emotional sentences. Product moment 
r’s between E scores and sex/neutral and 
aggressive/neutral ratios are —.25 and —.09, 
respectively. Neither of these values achieves 
statistical significance. The discrepant results 
for the two scales receive further consideration 
below. 

One possible interpretation for the superior 
perceptual performance of Ss scoring low on the 
F scale is that these individuals are more 
intelligent and thus more likely to recognize the 
unusual content of the emotional sentences. 
Fortunately, a score for “Language” IQ de- 
rived from the California Test of Mental 
Maturity was available for 31 of the 37 Ss. 
When this variable is held constant, the par- 
tial correlation with F score is —.32 for sex/ 
neutral ratios and —.35 for aggressive/neutral 
ratios. These values are significant at the .05 
level (one-tailed test) in spite of the smaller 
number of cases. Thus, differences in per- 
ceptual performance cannot be attributed to 
differences in verbal aptitude. 

Ot the 37 Ss satisfying the criterion of recog- 
nition, eleven were in the upper quarter and 
eleven in the lower quarter of the F-score 
distribution. This finding indicates the lack of 
any basic acuity difference between the more 
and less authoritarian. Table 1 supports this 





of Ss, however, showed superior recognition for the 
emotional stimuli, and then only to a very slight de- 
gree, the sample skewness is reduced drastically from 
what would be expected if ratios were formed by draw- 
ing random numbers. 
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contention. At auditory levels at which both 
upper and lower quarter Ss are able to identify 
neutral material, only the lower quarter Ss 
seem able to approximate this performance 
with the more emotional stimuli. The results 
for both judges achieve a high degree of 
agreement. 

A comparison of the recognition accuracy 
for sexual and aggressive stimuli of individuals 
in the upper and lower quarter of the F dis- 
tribution yields critical ratios with one-tailed 
probability values less than .05, based on the 
nonparametric Wilcoxon 7 test (10) applied 
to the sex/neutral and aggressive/neutral 
ratios. This level of significance was obtained 
when independent analyses were performed on 
data scored by the writer and by the inde- 
pendent judge. A comparable analysis of 
extremes on the E scale yielded nonsignificant 
critical ratios for both judges. These results 
corroborate the outcome of the correlational 
analysis and also serve as an indirect check on 
interjudge reliability. 


DISCUSSION 


The results clearly indicate the presence of 
an inverse relationship between authoritarian 
attitudes and the capacity to identify highly 
emotional stimulus sentences. While both 
high and low authoritarians perceive the emo- 
tional stimuli less accurately than the neutral 
material, the discrepancy is considerably 
greater in the case of the more authoritarian 
individuals. 

Adorno, et al. (1) describe authoritarian 
individuals as repressers. According to these 
authors, such individuals demonstrate a lack of 
conscious ambivalence of a hostile or sexual 
nature toward parental figures and other per- 
sons (outgroup members excluded) and deny 
any manifestations of weakness or passivity 
relative to the self. If it is assumed that re- 
pression has perceptual consequences, the 
marked inability of the authoritarians to iden- 
tify the sexual and aggressive sentences is 
experimental evidence in favor of the hypothe- 
sized relationship between repression and 
authoritarianism. 

The hypothesis relating nonauthoritarianism 
and intellectualization is confronted with 
equivocal evidence in the present experimental 
findings. The nonauthoritarian Ss do not 
demonstrate superior recognition accuracy for 
the emotional stimuli when neutral sentences 
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serve as a baseline. In this sense, these Ss 
can be distinguished from Lazarus, Eriksen, 
and Fonda’s (6) intellectualizing neurotics who 
perceived emotional stimuli more accurately 
than neutral stimuli. Of course, the latter set 
of results may be applicable only to pathologi- 
cal forms of intellectualization. White (9) in 
describing intellectualizing processes in ob- 
sessive neurotics refers to the detachment of 
sexual and aggressive impulses from the ego. 
This observation is in sharp contrast to the 
California investigators’ description of the 
“ego-assimilated” nature of sexual and 
aggressive impulses in nonauthoritarians. 
Whereas intellectualization, in their concep- 
tion, is an attempt to arrive at a genuine 
understanding of one’s problems, neurotic 
intellectualization implies a shift of one’s 
problems from a personal-emotional to an 
abstract impersonal plane. 

This marked disparity between pathological 
intellectualization and the ego-defenses of non- 
authoritarians calls into question the applica- 
bility of Lazarus, Eriksen, and Fonda’s 
theoretical interpretation to the present 
findings. These investigators maintained that 
the emotional stimuli were equally threatening 
to their repressers and intellectualizers, but 
were perceived differently because of the par- 
ticular ego-defense employed. In the present 
case, it seems much more probable that the 
distinctive personalities of authoritarians and 
nonauthoritarians make for differences in the 
subjective acceptability or ‘“threateningness” 
of the stimuli. Nonauthoritarians who are 
allegedly capable of accepting their own sexual 
and aggressive impulses should perceive stimuli 
embodying such impulses as less threatening. 

The degree of confidence that attaches to a 
repression interpretation would be considerably 
increased if an independent definition of de- 
fense mechanism were available. The lack of 
such a definition necessarily implies that the 
empirical findings do not provide a conclusive 
test of the hypothesized relationship between 
repression and authoritarianism. 

The lack of positive results for the E scale 
deserves further consideration. Since the E 
scale measures ethnocentrism and prejudice, 
it is obviously less closely related to the au- 
thoritarian syndrome than is the F scale. 
Correlations between E and F are generally in 
the vicinity of .6 to .7. Given the fact that 
repression is explicitly an hypothesized compo- 
nent of “the authoritarian personality” and 


only indirectly a component of ethnocentrism 
and prejudice, the discrepancy between E and 
F in the extent to which each is correlated with 
perceptual performance is not surprising. 


SUMMARY 


Inanexamination of the relationship between 
repression and authoritarian and ethnocentric 
attitudes, a group of male freshmen college 
students filled out the California E and F 
scales and then participated in an auditory 
perception experiment in which they were 
asked to identify a series of noise-masked 
self-disparaging sexual and aggressive sen- 
tences and neutral control sentences. The F 
scores were significantly related to the percep- 
tion of sentences in both the sexual and 
aggressive areas with the more authoritarian 
men demonstrating poorer recognition ac- 
curacy. The E scale, on the other hand, did not 
prove to be related to perceptual performance. 
The interpretation of the experimental results 
stressed the greater “threateningness” of the 
stimuli for the authoritarians. Differences in 
perceived threat were, in turn, attributed to the 
manifestation of stronger repressive tendencies 
in the authoritarian Ss. 
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THE AFFILIATION MOTIVE AND PERCEPTUAL SENSITIVITY 
TO FACES 


JOHN W. ATKINSON! anp EDWARD L. WALKER 


University of Michigan 


HE present investigation is an attempt 

to predict from imaginative measures 

of the affiliation motive (n Affiliation) 
the frequency with which a subject (S) selects 
human faces from similar but nonhuman 
figures in a perceptual task. It was undertaken 
with several objectives in mind. The sen- 
sitizing effect of motives on perception has 
been demonstrated in numerous studies (7), 
but in many instances the results have ad- 
mitted reasonable alternative interpretations. 
For example, McClelland and Liberman 
(6; 5, p. 257) have reported a study in which n 
Achievement scores obtained from imagina- 
tive (TAT) stories proved to be related to 
achievement-re- 


recognition thresholds for 


lated words. Farber (4), in a careful analysis 
of the role of motivation in verbal learning 


and performance, points out quite correctly 
that these results can be interpreted as a 
function of verbal habits common to the 
imaginative story performance and the word- 
threshold measuring situation. It was hoped 
that in the present study we could examine 
the relationship between a verbal-imaginative 
measure of and sensitivity to 
relevant stimuli in perception in a situation 
which would not admit Farber’s alternative 
interpretation or explanation in terms of the 
word-frequency variable discussed by Solomon 


motivation 


and Howes (8). 

The second objective was to obtain further 
evidence of the validity of the method of 
scoring imaginative (TAT) stories designed to 
measure motivation for social acceptance or n 
Affiliation (2). The demonstration of a positive 
relationship between n-Affiliation scores ob- 
tained from imaginative stories and perceptual 
sensitivity to stimuli related to affiliation would 
be further evidence that a motivational dispo- 
sition is being measured. 

The senior investigator has discussed else- 
where (1, p. 90) the theoretical basis for an 

1 This research was conducted during the period of 
a Social Science Research Council Faculty Research 
Fellowship. 


expectation that when the motive is meas- 
ured in a neutral situation (i.e., one in which no 
special operation has been performed either to 
arouse the motive or to create an especially 
relaxed condition), a relationship can be ex- 
pected with dependent perceptual and per- 
formance variables only when arousal opera- 
tions are performed immediately before the S 
begins the criterion task. A recent study by 
Moulton ef al. (1) has found a clear inverse 
relationship between n-Achievement scores and 
recognition thresholds for success and failure 
words when the motive was aroused before the 
dependent variable performance, but a similar 
relationship of lesser magnitude with neutral 
conditions preceding the perceptual task. 
The difference between the two conditions was 
statistically significant, however, only in the 
case of failure words. 

In the experiments to be reported, the 
strength of the affiliation motive was meas- 
ured by having several groups of male college 
students write imaginative stories under nor- 
mal (neutral) classroom conditions. These 
groups were subsequently subjected to pro- 
cedures designed to measure perceptual 
sensitivity to affiliation-related stimuli. Some 
groups performed the perceptual task under 
affiliation-neutral conditions; other groups 
were exposed to procedures designed to arouse 
the affiliation motive prior to the measure- 
ment of perceptual sensitivity for affiliation- 
related stimuli. 


METHOD 


Subjects. The Ss were 93 male undergraduates who 
at the time of their participation in the experiments 
were members of an introductory experimental psy- 
chology course at the University of Michigan. They 
constituted nearly the entire population of men in 
three semesters of this course in the years 1952, 1954, 
and 1955. 

Measurement of motive strength. In each class from 
four to six pictures were used to elicit imaginative 
stories which were scored for affiliation-related im- 
agery to obtain measures of individual differences in 
strength of n Affiliation. The instructions used and 
evidence of the validity of the scoring system may be 





AFFILIATION MOTIVE AND 


found 
Atkinson whose scoring reliability is above .90 
The affiliation motive is best defined in terms of the 


elsewhere (2). Stories were scored by J. W 


(2). 


content of imagery scored in the imaginative stories 
the concern 
maintaining, or restoring positive affective relations 


essence of which is over establishing, 
with other persons 

Experimental conditions. Approximately a month 
after the motive measures were obtained, the groups 
were introduced to the perceptual task. One whole 
class was introduced to this task with no immediately 
preceding attempt to arouse the affiliation motive. A 
second class participated in a sociometric rating pro 
immediately before being introduced to the 
perceptual task. In this procedure, while all Ss were 
seated together around a large room, each S was re- 
quired to place before him a card bearing a code num- 
ber 
of his classmates of more than a month’s acquaint- 
First they 
traits according to the degree to which possession of 
make a person likable. Then each 
list the two traits which best 


cedure 


Then he was required to make a series of ratings 


anceship were asked to rank-order a list of 


the trait would was 
pick from the 
described each other person in the room. Finally, each 
three 


asked to 
was asked to write down the code numbers of thx 


persons whom he would most like to have as close 
personal friends. This task took about 20 minutes. In 
half of the class run the 
neutral condition and the other half under a similar 
aroused condition. In an effort to keep the sociometric 


and the 


another year, was under 


perceptual task close together in 
establishing an apparent 
the perceptual task 
without the usual break in the two-hour 
laboratory The given was that time 
was runring out and that there was one other task to 
be completed within the regular laboratory period. It 
was implied that the two tasks were unrelated. 

The perceptual task. The Ss were randomly seated 
before a screen placed in the front of the room with 
their backs to a slide projector with a camera-type 
exposure diaphragm located at the back of the room 
They told that pictures would be projected on 
the screen before them at a speed and illumination 
inadequate for clear recognition. They were told that 
a picture would appear in each of four quadrants 
(top, bottom, left, and right), and though the image 
they received was very fuzzy and inadequate for rec- 
ognition, they would be asked to designate in which 
of the four quadrants the picture stood out the most 
or was most clear to them. They were asked to write 
down a symbol for the appropriate quadrant after 
each of 80 exposure trials 

The psychophysical. method, adopted from Black- 
well (3), is suitable for measuring discrimination 
without conscious awareness. Blackwell has reported 
detection thresholds of a highly reliable character ob- 
tained with this method when the S was quite positive 
that no stimulus difference actually existed in the 
situation. In the present experiment, the exposure 
interval and level of screen illumination were both 
deliberately set at very low values on an empirical 
basis in pretest trials with other Ss, so that nothing 
more than a faint blur could be seen by the Ss. Many 
Ss refused to believe that there was any picture on 


rating task 


time without relationship 


between them, was introduced 


immediately 


period excuse 


were 
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the screen during the exposure interval. Subsequent 
questioning of the Ss failed to reveal any who could 
give an adequate description of the composition of 
the exposure stimuli. For the most part, it can then 
be assumed that most Ss were performing their task 
at a perceptual level somewhat below the recognition 
threshold 

The stimuli. Each slide had four figures somewhat 
removed from the fixation point and equal distances 
up, down, left, and right from it. On every slide, one 
figure was that of a single face or several faces, and 
the other three figures were presumably neutral in 
the sense that they were not stimuli related to n 
Affiliation. Most of the neutral stimuli were pictures 
of home furnishings, e.g., a lamp, a plate about the 
size of a face, and in as far as possible having per- 
ceptual properties roughly equivalent to the pictures 
of faces 

There was a total of five face pictures and 15 neu- 
Each i appeared on four 
ach and neutral 
at random in the other three posi- 


tral pictures face-picture 


slides, once in e available position, 
stimuli appeared 
tions. This arrangement produced five sets of four 
slides each, for a total of 20 slides. The 20 slides were 
presented in a predetermined random order, then in a 
second and independently selected random order for a 
total of 40 trials 
make a total of 80 trials. Each exposure was preceded 
by a verbal signal, “Ready now,” in order to permit 
the Ss to focus on the fixation point in the middle of 
the screen. 

Treatment of the data. The conditions under which 
the perceptual task was executed varied somewhat 
from group to the experiment was 
carried out in a classroom in which adequate control 
of ambient illumination could not be exercised. For 
reason each experimental group treated 
separately. For each S, the number of times he se- 
lected the face or faces in 80 trials was determined. Z 
scores were obtained for each of these distributions 
and it was then possible to place together groups which 
experienced slightly different conditions. 

The n-Affiliation score in different groups was ob- 
tained from slightly different sets of pictures and from 
either four or six pictures. For this reason the n- 
Affiliation for each condition-homogeneous 
group was broken into High (above the median), and 
Low (below the median) n-Affiliation groups, and also 
into High, Middle, and Low (thirds) n-Affiliation 
groups. 


These two orders were repeated to 


group because 


this was 


scores 


RESULTS 


The primary result may be seen in Table 1. 
Those Ss who are high in n Affiliation tend to 
choose faces more frequently than those Ss 
who are low in n Affiliation. The difference is 
significant at the .02 level. Although not 
reported here in tabular form, a similar 
difference at approximately the same signifi- 
cance level appears when the group is divided 
into High, Middle, and Low on n Affiliation. 

Our treatment of the data permits us to 
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TABLE 1 
N AFFILIATION AND THE PERCEPTION 
(Frequency that quadrant containing face(s) was 
reported most salient in terms of Z scores from 
neutral and motive-aroused 
conditions combined) 


or Faces 


n-Affiliation Score 





High Low 
Mean z score 20 — .23 
0 1.01 .94 
N 50 43 
o diff .205 
t 2.09 
p .02 (one-tailed test) 


TABLE 2 
PERCEPTION OF FACES AS A FUNCTION OF STRENGTH 
or AFFILIATION MOTIVE AND 
EXPERIMENTAL CONDITION 
(Frequency that quadrant containing face(s) was 
reported most salient, in terms of Z scores from 
neutral and motive-aroused conditions 
presented separately) 


Perceptual Condition 


Neutral Aroused 
n-Affiliation Score N Mean N Mean 
z score z score 
High 16 .14 17 oma 
Middle 11 15 20 —.01 
Low 13 —.i1l 146 —.37 
o diff. (High vs. Low) .38* ao” 
t .66 1.67 
? n.S. 05 (one- 
tailed) 


* Derived from estimate of variance within groups = 1.0267 
with df = 87. 


compare all Ss who performed the perceptual 
task under neutral conditions with all those 
who performed it under aroused conditions 
even though each total group is composed of 
one semester’s whole class and a half of 
another. The results of this analysis may be 
seen in Table 2. Here the original finding is 
seen again in the aroused condition but to a 
smaller and not significant extent in the neu- 
tral condition. This difference between aroused 
and neutral conditions, although large to the 
eye, .58 as compared to .25, is not statistically 
significant (o diff. = .53). Furthermore, when 
this comparison is made with only a High-Low 
break of the n-Affiliation distribution the 
apparent difference is even smaller. 


DISCUSSION 

The clear relationship found between n- 
Affiliation scores and the tendency to select 
faces rather than neutral stimuli in a percep- 
tual task seems to support a relationship 
between motivation and perceptual selection 
of motive-relevant stimuli. Since the measure 
of perceptual sensitivity in this case did not 
involve words, it seems reasonable to reject as 
an explanation any alternative based on verbal 
habits as suggested by Farber (4) or one 
based on frequency of word usage such as that 
offered by Solomon and Howes (8). Indirectly, 
this study lends support te the original inter- 
pretation in terms of motivation and per- 
ceptual selection of some of the studies that 
have involved the use of words. For example, 
the McClelland and Liberman study (6) re- 
lating n Achievement to recognition threshold 
for achievement-related words could properly 
be explained on the basis of verbal habits when 
considered alone. However, when they are 
viewed in comparison with the present study 
where words were not a factor in perceptual 
selection, it seems doubtful if an appeal to ver- 
bal habits constitutes a sufficient explanation. 

The clear-cut primary results seem also to 
add to the evidence for the validity of the n 
Affiliation scoring system, since the scoring 
system yields statistically significant results in 
the expected direction on another variable. 

The findings with respect to neutral and 
aroused conditions were not significantly 
different from chance. However, they are in the 
expected direction and the magnitude of the 
difference is approximately what was ex- 
pected in comparison to the primary result. 
That is, there is some relationship in the neu- 
tral condition and more in the aroused condi- 
tion. Since this is the second study in which the 
identical finding has been obtained (see also 
Moulton ef al. [1]}), it seems likely that the 
theoretical expectation (1 p. 90) will even- 
tually be substantiated on the basis of fre- 
quency of appearance of the finding or in a 
study in which there is a very large N. 


SUMMARY 


Thematic apperception measures of n 
Affiliation were obtained from 93 male mem- 
bers of a class in elementary experimental 
psychology. Subsequently they were asked to 
say which of four figures which were flashed 
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on a screen in the four quadrants from the 
fixation point of up, down, left, and right, 
out the most or was clearest. The 
exposure speed and level of illumination was 
such that the stimuli were below the recogni- 
tion threshold. On each trial one of the four 
stimuli was a face or faces and the others were 
similar but affiliation-neutral stimuli. It was 
found that those high in n Affiliation selected 
faces significantly more frequently than Ss 
low in n Affiliation. The expectation that 
arousal of the affiliation motive just prior to 
participation in the perceptual task would 
produce greater differences between High and 
Low n Affiliation Ss was confirmed but not to a 
statistically significant degree. This study is 
interpreted as supporting the predicted rela- 
tionship between motivation and the per- 
ceptual selection of motive-relevant stimuli, 
and as lending indirect support to this inter- 
findings using word- 


stood 


pretation of similar 


recognition thresholds. 
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CONTRASTING PATTERNS OF FANTASY AND MOTILITY 
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HE occurrence of differential 
and manifestations of psychiatric illness 


patterns 


in various cultures has long been docu- 
mented by anthropologists and sociologists 
(2, 3, 10, 16, 17). There have been, however, 
relatively few studies employing psychological 
testing techniques to evaluate evidence of 
variations in personality characteristics of 
mental patients within American subcultural 
groups. The present study, part of a broader 
comparison of the psychopathology of two 
such subcultural groups, represents an initial 
effort to test certain hypotheses within a sam- 
ple of schizophrenics stemming from Italian 
and Irish backgrounds. Underlying this study 
is the assumption that certain factors in early 
family structures and cultural] 
conditionings have led to different modes of 


subsequent 


energy distribution and interpersonal orienta- 
tion in Americans of Irish and Italian back- 
ground and that these divergencies persist 
even when serious emotional disturbance is 
in evidence. 

The specific focus of the present investiga- 
tion is upon an aspect of personality tentatively 


designated as fantasy-motility. A considerable 
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report (18) with psychopathology and 
material derived from case histories, other 
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body of research has suggested that motor and 
fantasy behavior may be viewed to some ex- 
tent as functioning vicariously, the inhibition 
of motor activity leading to heightened fantasy 
or motion perception or the characteristic 
resort to fantasy being linked in many individ- 
uals with tendencies or capacities for motor 
inhibition (18, 21). The theoretical significance 
of this dimension stems from its relationship to 
a number of important theories of ego-develop- 
ment and the genesis and function of thought 
in human behavior (4, 8, 21). Much of the 
empirical work in this field has resulted from 
studies with the Rorschach inkblots and there 
is evidence that normals and schizophrenics 
who see relatively more Human Movement 
percepts (M) on the inkblots tend to show less 
overt movement, on the one hand, and to be 
more disposed to imaginative thought, on the 
other (4, 21, 22). To the extent that Ror- 
schach’s M response and its correlates may be 
taken as measures of basic ego functions in- 
volving the capacity to defer gratification and 
to resort to thought as “experimental action” 
(8), their employment in studies of subjects 
with differing cultural backgrounds has prac- 
tical as well as theoretical implications. 

There are suggestions in the literature that 
early identifications with significant family 
figures may be hasic to the development of 
delaying capacities or fantasy (9, 12, 17, 21, 
23). There is also evidence which points to dif- 
ferences in cognitive attitudes respecting time 
that seem to stem from differing family constel- 
lations in middle- and lower-class normals and 
schizophrenics (11, 13, 15, 20). These socio- 
economic class differences cut across the nor- 
mal-pathological dichotomy and it seems clear 
that, at least in response to projective test 
situations, the adult schizophrenic’s pattern 
of “time perspective” and parental attitudes 
reflects in part cultural influence on family con- 
stellations as well as specific pathogenic child- 
hood identifications. It is reasonable, therefore, 
to anticipate persisting personality differences 
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among schizophrenic adults who come from 
cultural backgrounds in which early family 
structures differ and in which adult values and 
customs continue to stress divergent patterns 
of inhibition or deferment of gratification. 
Within the crude nosology of severe emotional 
disturbance, one may still expect to observe 
characteristic patterns of fantasy and motility 
that reflect cultural origins and that incorpor- 
ate the gross symptomatology of schizophrenia, 
the hallucinations, delusions, the relative flat- 
tening of affect, and thought disorders within 
the context of these culturally-conditioned ego 
functions. 

The selection of Irish- and Italian-Americans 
as subjects for the study was based on accumu- 
lating anthropological and sociological indica- 
tions of marked differences in their cultural 
patterns, family constellation, and male social 
and sexual role positions (1, 2, 3, 5, 16, 17). The 
clinical experience of the authors based on 
intensive study with individuals from these 
cultural groups further amplified the general 
picture as did considerable evidence from the 
existing literature. In the Irish family an oft- 
observed central and controlling maternal role 
contrasts with that of the Italian mother who, 
protectiveness, often 
represents a greater paternal 
authority. Sexuality in the two traditions dif- 


despite warmth and 


reflection of 


fers as do attitudes towards their common 
religion. In the Italian family sexuality is often 
emphasized as part of the expressive and emo- 
tional life and is accepted as an assertion of 
maleness. Among the Irish, known for their 
mild and protracted courtships, delayed mar- 
riages, and celibate emphases, sexuality is 
clearly subordinated to procreation and, apart 
from this framework or from the marital set- 
ting, is more likely to be regarded as guilt- 
producing and sinful. In brief, the Irish family 
constellation with the relatively powerful 
mother-figure and female siblings tends to 
emphasize inhibition and delay of gratification 
with ambivalence towards the female and difh- 
culty in establishing an identification with a 
father-figure who is psychologically neutral 
and frequently absent from the home. 

In the Italian family, fairly common practice 
permits use of the normative cultural pattern 
for an expressive acting out of feelings. Thus, 
the emphasis is on direct expression of emotions 
resulting from conflicts with the relatively 
powerful father and older male siblings. Such 
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emotions may tend in their pathological ex- 
tremes to be built more readily into hostile 
patterns of reaction, often self-destructive as 
well as destructive of the male parental image. 
A previous anthropological survey by Opler of 
milder forms of mental illness among Irish and 
Italians in another hospital had suggested 
that the Irish male schizophrenic may be seen 
as beset with fear and guilt, controlling deeply 
repressed hostility towards female images 
chiefly through repression and fantasy. The 
Italian male model for schizophrenia seemed, 
on the contrary, given to more destructive and 
overtly hostile impulses, aimed chiefly at the 
male parental image but built up to uncon- 
trollable dimensions in accordance with a cul- 
tural value of emotional expression at all costs. 

In proceeding toward an operational formu- 
lation of hypotheses derived from the cultural 
differences between the Irish and Italian, it 
was decided to employ a battery of evaluative 
procedures with fairly well known properties. 
Rorschach’s Human Movement response (M/) 
seemed a logical variable for study as well as 
the Thematic Apperception Test Transcend- 
ence Index (22), both measures of fantasy 
disposition. A time-estimation task involving 
delaying ability for accuracy as well as a slow- 
writing task were included as measures of in- 
hibition. Similarly, the Porteus Mazes, involv- 
ing as they do a combination of motor restraint 
and foresight for successful performance, were 
included in the battery. The resort to fantasy 
and greater obsessional tendencies of the Irish 
might be expected to lead to more ready admis- 
sion of disturbance following failure in a 
Finally, in 
order to obtain something approximating a 
longitudinal section of behavior in the motor 


frustration-tolerance situation. 


area, ratings of ward behavior with respect to 
cooperativeness and aggression or assaultive- 
ness were employed (14). 


HYPOTHESES 


1. Irish schizophrenic patients differ from 
Italian greater 
evidence of fantasy-tendency. Specifically, 
they show more Human Movement responses 
on the Rorschach inkblots, lower threshold for 
Human Movement on Barron’s inkblots, and 
higher scores in Transcendence on Thematic 
Apperception Test stories. 

2. Irish patients show greater evidence of 
inhibition, motor inhibition, and restraint on 


schizophrenics by showing 
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tasks calling for delaying capacity. They re- 
quire longer times in a slow-writing motor 
inhibition task, score higher Test Quotients on 
the Porteus Mazes, and achieve greater accu- 
racy in Time Estimation in a situation 
involving delay in response. 

3. Irish patients more readily admit to 
disturbance following failure in a frustration- 
tolerance test. 

4. In ratings by personnel of their ward 
behavior, Irish patients are less aggressive, 
less overtly assaultive, and more coopera- 
tive in ward routines than are Italian 
schizophrenics. 


METHOD 
Subjects 


A total of 60 male veteran patients diagnosed as 
schizophrenic were chosen from the wards of the hos- 
pital so as to constitute two samples of thirty patieats 
each from Irish-American and Italian-American cul- 
tural backgrounds. Criteria for inclusion of patients 
involved (a) use only of clearly diagnosed schizo- 
phrenics without known organic brain damage, (6) 
first to third generation from both ethnic groups with 
residence in the New York City metropolitan area, 
(c) ages 18-45 inclusive, (d) patients sufficiently in 
contact to undergo the test battery and anthropological 
interview. Table 1 presents pertinent data comparing 
the two samples for age, education, date of first hos- 
pitalization, and Wechsler-Bellevue IQ. An evaluation 
of socioeconomic status was also carried out, using as 
indices parents’ education and occupation, income, and 
housing as well as Ss’ standing on these variables. 
Socioeconomic status and religion (all Ss were pro- 
fessed Roman Catholics) did not differ for the two 
samples. Within the Irish group, all families originated 
in the poorer Southwest counties while the Italian Ss 
with two exceptions were of South Italian or Sicilian 
origin. Generation level was approximately equated 
in each group, thus limiting differential effects of 
acculturation, a factor of considerable significance for 
motor patterns in Efron’s work (7). 


Procedure 


All Ss received an assessment battery administered 
in standard fashion. The following is a brief descrip- 
tion of the procedures and the measures employed to 
test experimental hypotheses: 

Rorschach Inkblots. All Rorschachs were scored by 
Klopfer’s method. The primary variable derived from 
the Rorschach was the Human Movement response 
(M). Only frequency of occurrence was considered in 
this study. Statistical analysis of other Rorschach 
data was limited to those variables in which a sufficient 
range of scores emerged. 

Barron’s Movement-Threshold Inkblots. These blots 
were included in order to evaluate their suitability as 
a measure of Rorschach M more easily obtained and 
more suitable for quantification. Barron’s inkblot 


series (4) consists of a series of black-and-white ink- 
blots arranged in a sequence which elicits increasing 
numbers of M responses over the range of 30 cards. 
The score employed was the card number on which 
the initial Human Movement response was offered by S. 

Thematic Apperception Test. Four cards, 1, 6, 7, 
and 12, of the Murray series were administered. 
Weisskopf’s Transcendence Index as employed in an 
earlier study (22) was applied to the content of the 
stories. This index, a measure of the extent to which 
S’s story introduces characters, activities, and emo- 
tions not actually present on the stimulus card, may 
be thought of as a reasonable operational definition of 
fantasy disposition. The actual score employed was 
the total number of transcendent items over the four 
cards. 

Porteus Mazes. This instrument was administered 
in standard fashion. Test Quotients were computed 
in accordance with norms available from Porteus (19). 
Qualitative notes on mode of approach were kept. 

Time Estimation. As a fairly direct measure of 
delaying capacity, the Ss were asked to indicate when 
they thought a given time interval had elapsed. Three 
times were used, 15, 30, and 60 seconds. The E started 
a stop watch and recorded the actual time passed 
when S signaled that he thought the stated interval 
had passed. The score was the sum ot the three esti- 
mations subtracted from absolute times. Since almost 
all Ss indicated that the time had passed before it 
actually had, accuracy and delaying ability were prac- 
tically identical. 

Motor Inhibition. This task, adapted from Downey’s 
Wili-Temperament scale, required Ss to write a simple 
phrase as slowly as possible without stopping the 
motion of their pencils. Time in seconds taken to 
write the phrase was the basic score. 

Admission-Denial of Frustration. This variable was 
derived from response to a simple experimentally in- 
duced failure experience devised by Wilensky (24). 
Following a period of nine successive failures on digit 
series, Ss almost invariably showed impairment in 
retention of digit series previously within their scope. 
Despite this apparent evidence of disturbed perform- 
ance following failure, many Ss denied conscious dis- 
tress upon direct inquiry. In this study each S was 
asked if he had been disturbed by his failures. Re- 
sponses were categorized as admissions or denials of 
disturbance following failure. Some cases were lost in 
this technique because of bizarre verbalization or 
refusals to comment. 

Behavior Ratings. To obtzin some evaluation of a 
longitudinal nature, two groups of items from the 
Multidimensional Rating Scales for Ward Behavior of 
Psychiatric Patients (14) were employed. These items, 
grouped into scales for Aggression and Cooperation, 
were rated by a staff clinical psychologist during inter- 
views with ward personnel, aides, nurses, and physi- 
cians, all of whom had observed the patients for ap- 
proximately six months on their wards. The emphasis 
in these scales is on actual incidents and overt behavior 
rather than on interpretations of underlying trends. 
Items on the Aggression Scale included instances of 
overt destructive behavior, assaultiveness, and verbal 
outbursts, while items on the Cooperation Scale in- 
volved such factors as willingness to follow ward thera- 
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TABLE 1 
MEANS oF IRISH AND ITALIAN SCHIZOPHRENICS ON 
PERTINENT BACKGROUND FACTORS, RORSCHACH 
DETERMINANTS, AND EXPERIMENTAL VARIABLES 





Variables Italian 


| 
—— i-- 


Background Factors 
Age 
Educational Grade Level 
Year of First Hospitalization 
Wechsler-Bellevue IQ 
Rorschach Variables 
Total Responses (R) 
Human Movement (M) 
Animal Movement (FM) 
Form-Color (FC) 
Color-Form (CF) 
Weighted Color Sum (Sum C) 
experimental Variables 
Movement-Threshold Inkblots 3 
TAT Transcendence Index Si 4. 
2 
8 





Porteus Mazes TQ 86. 

Time Estimation (dev. in secs. 44. 
from absolute times) 

Motor Inhibition Time (secs.) 147.0 106.5 

Admit Frustration 67% | 42% 

Aggressive Ward 1.60 2.30 
Rating 

Cooperation Behavior Rating 


Behavior 
1.87) 0.67 


peutic routines, assisting in ward chores, and accepting 
authority of ward personnel. As employed here, high 
scores on these scales imply considerable aggressive 
behavior and minimal cooperation. 


RESULTS 


Table 1 presents a comparison of Irish and 
Italian group means for background variables, 
Rorschach determinants, and experimental 
variables. The groups do not differ significantly 
in age, years of education, date of first hospital- 
ization (an estimate of chronicity), Wechsler- 
Bellevue IQ, or in the representation from 
various socioeconomic levels. Evaluation of 
significance levels was carried out by dichot- 
omizing Irish and Italian distributions for all 
variables at the medians and employing chi 
square to evaluate the association of cultural 
background with high or low scores on a given 
measure. Following Edwards (6), rg was calcu- 
lated between Irish-Italian on each measure 
as a rough estimate of extent of association. 
A positive correlation indicates higher scores 
for Irish Ss on the particular variable. Results 
of these analyses are presented in Table 2. 

Inspection of the tables suggests consistent 
support for the hypotheses. The Irish and 


TABLE 2 
VALUES OF x? AND fg FOR DICHOTOMIZED DISTRIBUTIONS 
or Irish AND ITALIAN GROUPS ON PERTINENT 
BACKGROUND, RORSCHACH, AND 
EXPERIMENTAL VARIABLES 





Variable 
Age 
Education 
Year of first Hospitalization 
Wechsler-Bellevue IQ 
Rorschach Responses 
Rorschach M 
Movement-threshold 
TAT Transcendence Index 
Porteus Mazes 
Time Estimation 
Motor Inhibition Time 
Admission-Denial of Frustration 
Aggression Behavior Rating 
Cooperation Behavior Rating 


-ooo 
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* Significant at 9/2 < .05 > .01 
** Significant at p/2 < .01 


Italian Ss do not differ in frequency of the 
Rorschach determinants except for the M re- 
sponses, which are definitely more numerous 
among the Irish. The Irish patients, as hy- 
pothesized, show lower thresholds for Human 
Movement responses in Barron’s inkblots, 
greater transcendence in their TAT stories, 
lower deviation from absolute times in Time 
Estimation, longer inhibition times in the slow- 
writing task, greater tendency to admit dis- 
tress following failure, and, in their ward 
behavior, less aggression and greater coopera- 
tiveness. Only the Porteus Maze scores reveal 
no differences and fail to support the relevant 
hypothesis. A qualitative difference in perform- 
ance on the mazes was evident, however. While 
both groups obtained similar scores, the Irish, 
according to qualitative notes, were far more 
often reported to be deliberate and cautious in 
performance, whereas the Italian Ss were more 
often described as impulsive and rapid. Why 
this difference did not affect ultimate outcome 
in maze score could not be ascertained. 

In connection with the ward ratings, it 
should be noted that ward personnel who 
provided the data and the raters were ignorant 
of the purposes of the experiment. Similarly, 
the difference between the groups in ward co- 
operation could not account for test score dif- 
ferences since, in the actual testing situation, 
the Italian Ss proved to be somewhat more 
genial and amenable to examination. 
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DISCUSSION 

There seems little question that the test 
performance of these two groups reflects clearly 
a difference in fantasy and motility that is 
strikingly in accord with anticipations based on 
anthropological evaluations of cultural frame- 
works and family constellations. When it is 
kept in mind that both the Italian and Irish 
patients are American citizens, most of them 
native born, that they have participated in 
the same urban culture and have also under- 
gone the standardization of military service, 
then the persistence of differences rooted in 
subcultural group participation is highly note- 
worthy. When it is further recognized that 
both groups of patients were grossly psychotic, 
the emergence of the differential fantasy- 
motility pattern is indeed a remarkable dem- 
onstration of hypothesized group dif- 
ferences. The patients in this study all showed 


the 


many symptoms of schizophrenia, although 
such symptoms as “flattening of affect” were 
Italian and “catatonic 
the 


less apparent in the 
excitements” were not characteristic of 
Irish. At the same time, despite the obvious 
fact that thinking disorders and hallucinations 
were present in most cases, wide variations in 
personality structure emerged. As James S. 
Plant, Fromm-Reichman, and Horney have 
suggested, however, these data and other 
materials from the same larger study (18) sug- 
gest that schizophrenias and other personality 
disorders highlight culturally engendered con- 
flicts. This view poses serious questions about 
the validity and usefulness of current psychi- 
atric nomenclature (16). Should study with 
further samples confirm the findings of this 
report, extensive familiarity with subcultural 
patterns may be increasingly essential for effec- 
tive psychiatric diagnosis and personality 
evaluation. The need for further work using 
test batteries like the one reported here with 
normal Ss from various subcultural groups is 
obvious. 

Pending further exploration it seems reason- 
able to conclude that schizophrenics stemming 
from Irish and Italian cultural backgrounds do 
show markedly different patterns of fantasy 
and motility. The results seem best interpreted 
as suggesting the possibility that because of 
the limitation on variability in family constel- 
lation imposed by cultural background, as well 
as the recurrent influences of cultural values 
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during development, characteristic differences 
in modal personalities (9) from different sub- 
cultural groups persist through adulthood and 
are in evidence even during a period of severe 
emotional disturbance. To the extent that such 
modal personality patterns may be derived 
from differing family constellations and cul- 
tural values, an extensive exploration of fan- 
tasy and motor activity as well as of other 
relevant personality dimensions by means of 
joint psychological testing and anthropo- 
logical survey with many other subcultural 
groups appears to be desirable. Such explora- 
tions might go beyond traditional cultural 
stereotypes toward an understanding of vari- 
ables underlying the relationships between 
patterns in family life and cultural standards 
as they interact to mold the adult personality 
in its normal and pathological manifestations. 


SUMMARY 


This study represented a portion of a larger 
investigation involving a comparison of the 
psychological test performance and psycho- 
pathological manifestations of schizophrenic 
subjects from different American subcultural 
groups. Based on anthropological evaluation of 
cultural patterns and child-rearing practices 
in families of Irish and Italian ethnic back- 
ground, certain hypotheses concerning differ- 
ences in fantasy and motor activity were 
developed. The Irish were expected to prove 
more given to imaginative behavior and motor 
control when compared with Italian Ss. When 
comparable samples of Irish and Italian schiz- 
ophrenic males were studied by psychological 
tests and ward behavior ratings, these hy- 
potheses were essentially confirmed. In the 
test results, no differences emerged in Ror- 
schach variables except for the Human Move- 
ment responses, more of which were produced 
by the Irish. The Irish showed lower threshold 
for perception of Human Movement on Bar- 
ron’s inkblots, greater transcendence of stimu- 
lus content in their TAT stories, longer motor 
inhibition times on a slow-writing task, longer 
delay in time estimation, greater tendency to 
admit distress following frustration. Ratings 
of ward behavior indicated the Irish as more 
cooperative in therapeutic routines and less 
overtly aggressive than the Italian patients. No 
differences in Porteus Maze Test Quotients 
emerged, although a qualitative tendency for 
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the Irish to work more slowly and hesitantly 
compared with the impulsivity and haste of 
the Italian Ss was clearly apparent. In general, 
the results appear to support the hypothesis 
of persistent differences in motor and fantasy 
activity between patients from the two ethnic 
groups. Implications for review of psychiatric 
nosolegy, personality research, and _inter- 


disciplinary collaboration are manifold. 
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THE EFFECT OF GOAL VALUE UPON EXPECTANCY’ 


LEONARD WORELL 
Reed College 


EHAVIORAL expectancies do not always 

appear to be governed by “realistic” 

factors in a situation, but seem also to 
be a function of wishes, fears, and doubts. Ob- 
servers disagree, however, about the manner in 
which differentially valued goals affect expect- 
ancies. Some believe, for example, that expect- 
ancies are lowered, in the face of highly valued 
goals, as evident in the student before an im- 
portant examination, while others maintain 
that they are raised, as in gambling situations. 
The present study attempts to examine some 
of the conditions under which expectancies may 
or may not be related to different levels of goal 
value. 

This research was conceived within the 
framework of a social learning theory of per- 
sonality developed by Rotter (11). Within this 
system predictions of behavior are mediated 
by two major constructs, expectancy and re- 
inforcement value. Expectancy is defined as 
“the subjective probability held by the indi- 
vidual that a particular reinforcement will 
occur as a function of, or in relation to, a spe- 
cific behavior in a given situation or situations” 
(11, p. 112). Expectancies are viewed as being 
determined by experience in particular situa- 
tions and in related situations. Thus, a student 
taking a history course may have expectancies 
that studying will lead to a certain grade, based 
upon experiences in this history course, other 
history courses, other academic courses in 
general, and so on. The second construct, re- 
inforcement value, is defined as “‘the degree of 
preference for any reinforcement to occur if 
the possibilities of their occurring were all 
equal” (11, p. 112). This value may be either 

1 Adapted from a dissertation submitted in partial 
fulfillment of the requirements for the degree of Doctor 
of Philosophy in the Department of Psychology, The 
Ohio State University. The writer wishes to express 
his appreciation to Professors Julian B. Rotter, Delos 
D. Wickens, Lauren G. Wispé and Robert J. Wherry 
for their helpful suggestions and advice. The writer is 
indebted to Russell Slater, director of the Child Study 
Division, and to the principals and teachers of the 
participating schools in the Columbus Public School 
System for their cooperation in making subjects avail- 
able. 
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positive or negative. The terms goal value and 
reinforcement value are used interchangeably 
hereafter. 

From a social learning point of view, there 
is no necessary relationship between expectancy 
and goal value. Under certain conditions a 
specific relationship may be anticipated, while 
under others the nature of the relationship is 
obscure. The purpose of this study is twofold, 
to pursue an empirical phenomenon of general 
interest and to contribute to the predictive 
power of Rotter’s theory. 

In examining the relationship of goal value 
to expectancy, previous investigators have con- 
sistently used mean expectancy scores derived 
from a series of trials. For this type of situation 
the nature of the relationship between the con- 
structs cannot be logically deduced from Rot- 
ter’s theory. From the standpoint of other 
systematic points of view and general observa- 
tions, conflicting conceptions emerge. Thus, in 
Lewin’s formulations (8), as the difficulty level 
of a situation increases the value of success 
increases, while the subjective probability of 
success attainment decreases. The analysis of 
individuals confronted with high value situa- 
tions seems to bear this out, for low expect- 
ancies are generally elicited. The suggestion 
follows that confidence in attaining goals de- 
creases as they become more important, pos- 
sibly because the acquisition of high goals in 
the past has involved considerable effort and 
competence. On the other hand, it seems 
equally reasonable to suppose that expectancies 
increase in highly valued situations. Our society 
might be seen as providing a particularly strik- 
ing example, in that both striving for high 
goals and displaying confidence in one’s ability 
to achieve them are held as the ideal. In view 
of these alternatives, the exact nature of the 
relationship between goal value and expectancy 
over a series of experiences in the same situa- 
tion must be a matter for empirical de- 
termination. 

On the basis of Rotter’s theory, at least two 
conditions should lead to the absence of any 
observable relationship between the constructs. 
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One of these hinges on an assumption of social 
learning theory that expectancies are a function 
of experiences in both a particular situation and 
in related situations. Expectancies in a rela- 
tively novel situation should thus be deter- 
mined by experiences from related situations. 
Such related experiences may include the occur- 
rence (or nonoccurrence) of past reinforcements 
and, possibly, the values of these reinforce- 
ments. With additional experience in a situa- 
tion, expectancies should increasingly be 
determined by the actual occurrence of rein- 
forcements in that situation (2, 4). Although 
the nature of the influence of different goal 
values upon expectancies cannot be specified 
in a novel situation, after experience with that 
situation expectancies should therefore be 
determined only by the occurrence and not by 
the value of the reinforcements. 

The second condition in which differential 
goal values should produce little or no effect 
upon expectancies deals with the introduction 
of a penalty for inaccuracy in stating expect- 
ancies. Relatively widespread use has been 
made of penalties in level-of-aspiration experi- 
ments. According to social learning theory, a 
penalty may be viewed as a reinforcement value 
associated with accuracy. This additional re- 
inforcement value (often interpreted as a nega- 
tive reinforcement by the S) should counteract 
the tendency to make cautious or wishful ex- 
pectancy statements engendered by high or low 
goal values. If the severity of the penalty is 
identical for different goal value conditions, 
therefore, expectancies should be uniform in 
relation to these goal values. In addition, 
expectancies should be more realistic or per- 
formance-oriented than in nonpenalty con- 
ditions. 

In summary, three aspects of the relationship 
between goal values and expectancies are under 
investigation. The question is first asked 
whether (a) differential goal values affect ex- 
pectancies over a series of experiences in the 
same situation. The remaining two aspects 
involve attempts to support extensions of 
Rotter’s social learning theory. Thus, (5) the 
relationship, if any, observed in a is hypothe- 
sized to be a function of the amount of experi- 
ence in the situation. Relatively novel 
situations may elicit a reiationship between 
expectancies and differential goal values, but 
experience in these situations should produce 


uniform expectancy statements. Finally, (c) it 
is hypothesized that the introduction of a re- 
inforcement value for accuracy leads to more 
realistic expectancies, on the one hand, and 
expectancies that are unaffected by the value 
of an event, on the other. 


METHOD 
Selection of Tasks 


A number of requirements had to be met in the 
selection of tasks. First, the tasks had to be relatively 
novel, so that expectancies would not be established 
pre-experimentally. Second, they had to permit E to 
control performance, in order that the reinforcements 
would be uniform for all Ss under all value conditions. 
Finally, they needed face validity for the goal area 
involved, i.e., athletic skills. From a pilot study, three 
tasks were chosen, each one representing a different 
goal-value level. These were (a) the Rotter level-of- 
aspiration board (10) modified by a cover placed over 
the numbers so that S was unable to see his perform- 
ance; (b) a rotary pursuit test (score measured by 
amount of time that the stylus is kept on revolving 
disc); and (c) a set of six blocks for a tapping test. 
The results of the pilot study indicated that the tasks 
were perceived as having differential goal values. 
Through a ranking procedure, it was found that the 
level-of-aspiration board and the rotary pursuit task 
were more highly valued than the tapping task. These 
initial preferences were coupled with instructions (see 
below) that assigned more or less value to each task. 


Subjects 


A total of 102 boys, ranging in age from nine to 
eleven, wis selected from the fifth and sixth grades of 
the local public school system. Eight Ss who were 
unable to comply with the instructions were discarded. 


Administration of Tasks 


Each S was seen individually. He was seated at a 
table on which the three tasks were anayed. The S 
was given a description of each task in terms of its 
relative merit in providing information about a per- 
son’s athletic abilities. Three levels of goal value were 
employed, designated as high, medium, and low. For 
the high reinforcement-value task (level-of-aspiration 
board), S was told that this was the most important 
task, that it provided the most information about 
skills required in all sports, e.g., coordination, timing, 
and balance, and that this task was of greatest con- 
cern to E. The medium value task (rotary pursuit) was 
said to be not quite as useful and to provide less 
information about a person’s abilities. Finally, the 
low value task (tapping) was described as much infe- 
rior to the others and of almost no interest to Z. When 
tapping was followed by one of the other tasks, S was 
also instructed to “get this one over quickly, so that 
we can get on to the others.” 

A level-of-aspiration procedure was used in obtain- 
ing expectancies. The S was informed of the range of 
possible scores (0 to 50) and was asked to provide an 
estimate cf his subsequent performance for each trial. 
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TABLE 1 
Mean D Scores AND STANDARD DEVIATIONS FOR 
ALL Groups 
Nonpenalty Group | yd 
sTroup 


Goal Value 


Six Trials ist Trial 6th Trial Six Trials 

M SD M SD M SD | M SD 

High 5.5 | 16.2 9 4.¢ 3 2.8 3.7) 14.6 

Medium | 11.6 17.1 3.8 4.9 7 2.7 5} 15.7 

Low | 10.6 | 14.6 3.0 4.5 1.0 3.0 2.8) 14.6 
TABLE 2 


MEAN DIFFERENCES AT EACH COMBINATION OF 
REINFORCEMENT VALUE 


Reinforcement Value Conditions 


Medium and 
OW 


Nv High and 


High and | 
| Medium | 


Group 
7 Ow 


Nonpenalty| 71 | 6.1/3.82 001} 5.1/3.53 001) 1.0) .S4) NS 
ist Trial 71 | 2.9)4.10 001) 2.1/2.91 | 8|1.27; NS 
6th Trial 71 411.3 20 7|2.33 0 3} .28) NS 
Penalty 23 | 3.2/1.14) NS | 6.5/3.01/<.01 | 3.3/1.92) .10 


This was followed over the series of six trials. Tasks 
were rotated, so that Ss performed differentially valued 
tasks in varying orders. A score of either 26, 27, or 28 
was given S on each task immediately after his first 
performance. Following this, a predetermined series of 
scores was supplied which provided the same mean 
increase in performance over the six trials for each of 
the three tasks. 

Seventy-one Ss performed the tasks without a 
penalty for inaccuracy of estimates. Another 23 Ss 
performed the same tasks with a penalty being estab- 
lished for inaccuracy by instructing S that he would 
lose twice the number of points that his expectancy 
fell below his actual performance, and that he would 
obtain exactly what he expected if his actual score 
were at or above his expectancy estimate. Each S was 
given detailed examples and was not permitted to 
begin the tasks until he could calculate his score on 
an example without assistance. 


RESULTS 


Analysis of data was based on the stated ex- 
pectancies to the three tasks of high, medium, 
and low goal value. A D score was employed, 
the difference between the expected score and 
the preceding performance. Tests of the hy- 
potheses, therefore, are focused on the differ- 
ences between mean D scores (Table 1) for each 
of the reinforcement-value conditions. 

Effects of goal value. Bartlett’s test of homo- 
geneity of variance was applied before perform- 
ing an analysis of variance and the assumption 
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of homogeneity was supported. The analysis of 
variance indicated that the difference attribut- 
able to reinforcement-value conditions is highly 
significant (p < .001). Two-tailed ¢ tests for 
correlated means were used in comparing mean 
D scores. Two of the three comparisons are 
significant (Table 2); the mean D score at the 
high value is significantly lower than either the 
medium or low values. 

Effects of experience. As six trials were used, 
differences between expectancies might be 
anticipated at the first trial, while none should 
appear at the sixth trial. Analyses of variance 
indicated that the reinforcement-value condi- 
tions continued to provide significant sources 
of variance at both trials (p < .001 at Trial 1, 
p < .05 at Trial 6). When mean D scores are 
compared at Trial 1 (Table 2), the differences 
between the high and medium values and the 
high and low values are highly significant. At 
the sixth trial, a significant difference is found 
only between the high and low value conditions. 

Effects of penalty. Table 1 contains the mean 
D scores for groups with and without a penalty 
associated to expectancy statements. Inspec- 
tion of this table confirms the hypothesis that 
a penalty leads to more realistic expectancies. 
“Realistic” is defined in terms of the mean D 
scores’ proximity to zero, or stated in another 
way, the extent that expectancy statements 
approach previous performance. The differences 
between penalty and nonpenalty groups are 
significant in all cases, falling at less than the 
.02, .01, and .05 levels at the high, medium, 
and low values respectively. 

Our interest here, however, lies also in the 
differences between mean D scores at the three 
value conditions of the penalty group alone. As 
before, the analysis of variance continued to 
show that a significant source of variance may 
be attributed to the reinforcement-value con- 
ditions (p < .05). In applying two-tailed / tests 
for correlated means (Table 2) to the differences 
between mean D scores, the only significant 
difference is found between high and low condi- 
tions, although the difference between medium 
and low conditions approaches significance. 
When the differences between means in the 
penalty group are examined more closely, it 
appears that the effect of a uniform penalty at 
all value conditions is to produce more pro- 
nounced separations between high, medium, 
and low goal values. 
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DISCUSSION 

The most general statement that may be 
drawn from the results is that the value of a 
goal has an effect upon the level of stated ex- 
pectancy. It was found that (a) the presence of 
high goal values leads to significantly lower 
expectancy estimates over a series of experi- 
ences in the same situation than do either 
medium or low value conditions; (b) the amount 
of experience, though appearing to dilute some- 
what the effects of goal values upon expectancy, 
does not eliminate the tendency for high goals 
to be associated with lower expectancies in the 
six trials of the experiment; and (c) with the 
inclusion of a penalty (reinforcement value) for 
inaccuracy, contrary to what was anticipated, 
expectancies continued to be lower in relation 
to highly valued goals. A penalty did, however, 
lead to more realistic or performance-oriented 
expectancy estimates. 

The influence of reinforcement value upon 
expectancy was not demonstrated at all levels 
of goal value. That expectancies were relatively 
lower for high values suggests that the fre- 
quently quoted expression “the good things 
are always hard to get” may well be applied 
here and that individuals appear to perceive 
high goals as difficult in the Lewinian sense. 
The failure to obtain significant differences be- 
tween medium and low value conditions, how- 
ever, requires explanation. A possible factor 
may have been the relative absence of perceived 
separation between the experimental goal-value 
conditions. The division between the three 
goal values was made in accordance with an 
expected theoretical and experimental gradient. 
It is not known, however, how distinct the 
goals were or how important the distinction 
seemed to the Ss. A completely satisfactory 
method for measuring precise distances be- 
tween goal values has yet to be developed. 

Goal values appeared to exert their strongest 


influence upon expectancies in relatively novel 
situations. This finding suggests that different 
initial expectancies are held in relation to vary- 
ing goal levels from previous related situations. 
People seem to differentiate relatively novel 
situations not only in accordance with the pos- 


sible occurrence (or nonoccurrence) of rein- 
forcements, but also with regard to the values 
of the reinforcements. The behavior of the Ss 
in this study suggests that individuals may 
ordinarily respond in terms of two steps along 


this continuum, e.g., high and low value. The 
presence of a penalty seemed to be a condition 
leading to more subtle. discriminations between 
values. 

Reference has already been made to the fact 
that the effects of goal value on expectancy did 
not disappear with experience. There was, how- 
ever, a tendency for the influence of goal values 
to be diminished. This observation derives 
further support from a comparison of the mean 
changes that are fourd at each combination of 
reinforcement value between Trials 1 and 6. 
When the ¢ test? is applied to these mean 
changes, the differences between trials for the 
high and med: ‘m (¢ = 2.8, p < .005) and the 
high and low (« = 1.9, p < .03) are highly sig- 
nificant, while that for the medium and low 
values approaches significance (¢ = 1.6, p = 
.06). This finding is interpreted to mean that 
differences between expectancies have been re- 
duced at the sixth trial and that a more ade- 
quate test of the effects of experience would be 
made by extending the number of trials beyond 
Six. 

In securing a particular relationship between 
goal value and expectancy, this study may 
seem merely to have added another conflicting 
result to an array of studies already in disagree- 
ment (1, 3, 5, 7, 9). Representative of one kind 
of previous approach to the problem is a study 
by Marks (9), which sought to determine the 
desirability of outcome on the stated expecta- 
tions of children in certain choice situations. 
Using what may be referred to as a quasi- 
gambling technique, she found that the value 
of an event exerted a strong effect upon stated 
expectations. Contrasted with this, Holt (5), 
employing what may be called a competitive 
level-of-aspiration technique, demonstrated no 
effect of “ego-involvement” on the stated ex- 
pectancies of his college Ss. The present study 
adds to the picture of inconsistency. 

A possible resolution may be achieved by 
differentiating the kinds of situations studied. 
At least two broad divisions of situations may 
be distinguished: achievement and nonachieve- 
ment. In the former, performance is dependent 
upon ability or skill, and thus reflects upon an 
individual’s competence. In the latter, which 
may include gambling, some play and simple 


2 Since these were anticipated outcomes of our hy- 
pothesis regarding effects of experience, one-tailed 
tests of the distribution of ¢ were employed. 
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choice situations, no ability or skill is promi- 
nently involved, and an individual’s competence 
is not challenged. One might anticipate differing 
results of the effects of goal value upon expect- 
ancy under these two conditions, since the 
achievement situation introduces additional 
goal values. 

This distinction appears to assist in resolving 
disparities in the results of studies such as that 
of Marks and the present one. Her results 
strongly indicated that increased value led to 
increased expectations. According to the pres- 
ent analysis, her study would fall within the 
nonachievement category, since children were 
expressing what they wanled to happen; no 
ability or skill was implied. In this sense, her 
Ss were faced with a form of gambling, and 
culturally, one may find a greater tendency to 
take risks in these situations. In this investiga- 
tion, performance was related to ability, so 
that an additional goal value was introduced. 
Expectancies held in relation to previous 
achievement situations would therefore be most 
relevant. The results of this experiment dem- 
onstrated that expectancies decreased with 
increasing value in relatively novel situations. 
Whether this proposal has any merit aside 
from providing post hoc explanation will be de- 
termined by future experimentation. 

The findings were somewhat equivocal with 
respect to the penalty for inaccuracy of expect- 
ancy statements. Though expectancies were 
more accurate, they did not appear to be re- 
moved from the effects of goal values. Before 
conclusions can be drawn, it would seem neces- 
sary to investigate the problem of penalties 
more thoroughly. Different severities of penal- 
ties and the effectiveness of instructions in 
communicating penalties could well bear 
further examination. 


SUMMARY 


In Rotter’s social learning theory of person- 
ality, predictions of behavior are made through 
the operation of two central constructions, 
expectancy and reinforcement value. The 
present study was designed to investigate the 
effect of different goal (reinforcement) values 
upon expectancy. 

Three levels of goal value were employed in 
a study of fifth- and sixth-grade boys. Expect- 
ancies developed from related situations were 
controlled by having each S perform each of 
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three relatively novel tasks. In addition, the 
performance of S was controlled by giving pre- 
arranged sequences of scores. Thus, with both 
performance and experience from related situa- 
tions controlled, the only factor varied was the 
value of each task. 

The most consistent finding was that the 
value of an event has some effect upon stated 
expectancy. Further results of the study may 
be summarized as follows: 

1. Expectancies observed over a series of 
experiences in the same situation were found to 
be significantly lower in highly valued situa- 
tions as compared to situations of lower value. 

2. Though the influence of different goal 
values upon expectancies appears to become 
more uniform with experience, expectancies 
continued to be significantly lower in high 
value conditions. The necessity of investigating 
more extensive amounts of experience than 


those considered here was suggested and 
discussed. 
3. The association of a goal value for 


accuracy (penalty) to expectancy statements 
leads to more realistic expectancies, that is, 
expectancies tend to approach previous per- 
formance more closely. The penalty, however, 
did not appear to eliminate the effect of differ- 
ent goal values upon expectancy. Expectancies 
were found to be relatively lower to high goal 
values, as they had been in situations without 
a penalty. 

Finally, in an attempt to provide a resolution 
of the seemingly contradictory results of pre- 
vious studies in this area, the potential utility of 
a distinction between achievement and non- 
achievement situations was discussed. 
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PERSONAL CONSTRUCTS AND PREDICTIVE BEHAVIOR! 
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Indiana University 


CCORDING to the psychology of personal 
constructs as recently developed by 

Kelly (3), all men may be thought of 

as scientists in the sense that each is concerned 
with the prediction and control of his environ- 
ment. Further, each individual seems to de- 
velop his own personal repertoire of constructs 
by means of which he structures his world and 
tries to anticipate events. These constructs 
may be thought of as the elements of a system 
by means of which the individual codifies his 
experience. Thus, the psychology of personal 
constructs is concerned with the ways in which 
personal construct repertoires develop and 
change and the ways in which they can be uti- 
individual behavior. 


lized in accounting for 

Kelly defines a construct as a way in which 
two things are alike and at the same time dif- 
ferent from a third. While more than three 
elements may be involved either in the develop- 
ment or communication of a construct, at least 
three must be present. The construct is con- 
sidered bipolar, one pole designating the basis 
for the similarity and the other pole the con- 
trast. Where persons are used as elements, 
some of the typical constructs found among 
college students are sincere vs. insincere, 
friendly vs. unfriendly, good looking vs. un- 
attractive. 

Among the ways in which personal con- 
structs may be characterized, one dimension 
involves the extent to which the use of the 
construct is independent of or contingent upon 
the use of other constructs in the total con- 
struct repertoire. Where a construct is highly 
related to a large number of other constructs, 
it is termed constellatory. Where the converse is 
true, the construct is termed propositional. In 
Kelly’s terms, “‘a constellatory construct is one 
which fixes the realm membership of its ele- 
ments—for example, stereotypes: ‘Anything 
which is a ball has got to be...’”’ “A propo- 
sitional construct is one which does not disturb 

1 This study is based upon portions of the research 
presented in the writer’s doctoral dissertation sub- 
mitted to the Ohio State University in 1954. Special 
appreciation is extended to Dr. George A. Kelly for 
his advice and guidance. 


the other realm memberships of its elements— 
for example, ‘philosophical attitudes’: ‘Any 
roundish mass can be considered among other 
things as a ball’. . .”” (3, p. 155). 

If Kelly is correct about the role of personal 
constructs in predictive behavior, then where 
predictive failure has occurted, certain changes 
should be evident in the person’s use of his 
constructs. The present paper is concerned 
with this problem in relation to the distinction 
between constellatory and _ propositional 
constructs. 

There are two ways in which a person’s use 
of his constructs might change. The first would 
consist of shifting the basis for his construction 
of an event or invoking a different construct 
or set of constructs (5). An example would be 
where one had made use of the construct honest 
vs. dishonest in construing his associates and 
upon finding that he was not able to predict 
their behavior very well, shifted to the con- 
struct reliable vs. unreliable. The second form 
of change in construct use is the one studied in 
this paper and may be referred to as recon- 
struction. Here the individual uses the same 
construct but shifts the position of the elements 
or reconstrues the elements. Thus, in the case 
of the construct honest vs. dishonest, if one 
construes another as honest but finds that pre- 
dictions based on this construction are not 
supported, then one might try to raise the 
accuracy of his predictions by reconstruing the 
other person as dishonest. 

Three hypotheses were consequently formu- 
lated for testing in the present study: 

1. Where extensive revisions are required 
in an individual’s predictive behavior (high 
invalidation), a greater amount of reconstruc- 
tion will occur on constellatory constructs than 
on propositional constructs. 

2. Where minor revisions are required in an 
individual’s predictive behavior (low invalida- 
tion), the amount of reconstruction occurring 
on propositional constructs will be greater than 
or equal to the amount of reconstruction oc- 
curring on constellatory constructs. 


3. Constellatory constructions are more 








sensitive than propositional constructions to 
the amount of invalidation occurring for a set 
of predictions. 

The third hypothesis is the most general of 
the three and is in a sense implied in the first 
two. All three are predicated on the assumption 
that constellatory constructs, by virtue of their 
broader range of interdepender.cy, mediate a 
broader range of predictions and hence are 
affected only when major revisions in predic- 
tions are necessary. Because of the relative 
independence of propositional constructs, these 
will be less affected by the extensiveness of 
invalidation. Reconstruction is assumed to 
occur on propositional constructs only in con- 
nection with fairly specific or minor instances 
of predictive failure. 


METHOD 
The Role Construct Repertory Test 


The Role Construct Repertory Test (RCRT) (3) 
was designed to elicit a set of constructs from an indi- 
vidual which research to date (1, 2, 4) indicates has a 
fair degree of stability and generality. The RCRT may 
be used either as an individual or as a group test. In 
the present study, the group form was used. In this 
form, S is first presented with a list of nineteen role 
titles such as, “Your mother, or the person who has 
played the part of your mother”; “your girl (boy) 
friend or closest present girl (boy) friend”; “the per- 
son with whom you usually feel most uncomfortable,” 
etc. For each role title, S must supply the name of a 
different person. The S is next presented with various 
combinations of these names in the form of triads and 
is asked to indicate which two people are alike and 
at the same time different from the third. He is then 
asked to write the way in which the two are alike 
under the heading “Construct” and to write the term 
or phrase which he considers its opposite under the 
heading “‘Contrast.”’” Each of these triads is termed a 
sort; twenty-two sorts are required of each S. There is 
no restriction on the kinds of terms used or on the 
repetition of terms. 

Following the 22 sorts and the elicitation of con- 
structs, S is finally asked to consider each of the 19 
persons he has listed in terms of each of the constructs 
formed. For each person, he indicates whether the 
construct or its contrast applies. This final procedure 
permits the application of a nonparametric factor- 
analytic procedure and the subsequent identification 
of the relationships among the constructs used by 
each S. A complete exposition of this method may be 
found in Kelly’s book (3). 

The factorial method applied to RCRT protocols 
usually identifies between two and seven factors. Of 
these, one factor, typically the first or second extracted 
is saturated with a larger number of constructs than 
any of the others and is referred to as a general factor. 
In the present study the assumption was made that 
all constructs having significant loadings on the same 
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factor would be more interdependent with each other 
than with other constructs. Thus, constructs having 
significant loadings on a general factor were considered 
to have a wider range of interdependency as compared 
to other constructs. Consequently, all constructs hav- 
ing significant loadings on a general factor were desig- 
nated constellatory; all others were propositional. 


Eliciiation and Classification of Constructs 


RCRT’s were obtained from 80 students in two 
sections of a course in educational psychology. Each 
protocol was factored until one general factor was 
extracted which contained eight or more constructs 
with significant loadings. Five constellatory and five 
propositional constructs were then selected from each 
protocol. For constellatory constructs, all those with 
significant loadings on the general factor were rank- 
ordered in terms of the magnitude of their factor 
loadings, and the first five nonrepeating constructs 
were selected. The first five constructs appearing in 
the protocol which did not have significant loadings 
on the general factor and which contained no obvious 
repetitions among them were designated as proposi- 
tional constructs. One additional criterion used in 
selecting constructs was that they had to be applicable 
to persons other than those on whom they were formed, 
i.e., they could not refer to kinship, sex, age, or occupa- 
tional status. Thus, with the possible exception of the 
last criterion, the process of construct selection and 
classification was entirely a nonjudgmental one. 

Out of the original set of 80 protocols, 25 were elimi- 
nated either because the general factor contained more 
than 17 constructs, thus not permitting the selection 
of the five propositional constructs, or because too few 
of the constructs elicited met the criteria of applica- 
bility. The participation of the 55 remaining Ss was 
solicited for the experiment proper; of these, 40 (33 
females and 7 males) volunteered. 


Individual Sessions 


Each S was presented with photographs of two 
persons. Their order of presentation was systematically 
alternated over Ss. The two photographs used were of 
male graduate students taken when each was about 
18 years of age. Students were told that the experi- 
ment was concerned with the accuracy with which 
personality could be judged from photographs. They 
were told that this was a skill which everyone pos- 
sessed to some degree and that in this experiment Z 
was interested in seeing how well they could utilize 
this skill. The Ss were then told that E would be 
interested in how they described each person in their 
own terms (their constructs) and how accurately they 
could answer fifteen questions about each person’s 
behavior. 

For their constructions each S was presented with 
a blank on which his five constellatory and five propo- 
sitional constructs, both construct and contrast, had 
been typed so as to represent two alternatives. The 
sequence of occurrence of each type of construct was 
the same for all Ss with constellatory constructs located 
at ordinal! positions 1, 3, 4, 5, and 9 in the list. 

The prediction questions which Ss were to answer 
were of the following type: 
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TABLE 1 
MEAN NuMBER OF CHANGES IN CONSTRUCTION OF 
CONSTELLATORY AND PROPOSITIONAL CONSTRUCTS 


UNDER ConpDITIONS OF HIGH AND 
Low INVALIDATION 
Construct 
Conditions Constel Proposi- ‘ *” 
latory tional 

High !nvalida- 1.900 1.550 2.800 .01 

tion 
Low Invalida 775 900 862 ns 

tion 
Difference 1.125 .650 2.387 .05 


* All probabilities are for a two-tailed test since one of the 
problems under investigation was whether the classification of 
constructs as constellatory or propositional was related in any 
way to changes in these constructs under conditions of high and 


low invalidation. 


1. With regard to going to parties, this person 
usually 
a. Jumps at the opportunity. 
b. Regards them as a “bore” and goes rarely. 
c. Will decide whether to go on the basis of who 
will be there. 
d. Regards them as a “bore” but goes anyway. 
e. Just isn’t interested 
2. When he thinks about the future, he: 
a. Is full of wonder and enthusiasm 
b. Has doubt about his ability. 
c. Can see mostly calamity. 
d. Is full of plans—has it all worked out. 
e. Can’t get very interested—more concerned 
with the present 

The two tasks were presented as distinct from each 
other and on separate sheets of paper. As S finished 
each set of constructions and predictions, E pretended 
to compare them with fictitious data on the person 
involved. When constructions and predictions had 
been made for both persons, E told S that on this trial 
he had only checked on the accuracy of his predictions 
and had not looked at his descriptions. The £ then 
informed S that on one of the persons he had done 
rather poorly, below the mean in accuracy for his 
group (high invalidation) while on the other person 
he had done quite well, scoring above the mean al- 
though missing a few items (low invalidation). The 
Ss were not told either which or how many items they 
had presumably missed. 

The Ss were then told that although they had not 
been given any more specific information than how 
accurate they were in their predictions, they were to 
try to improve on their accuracy of prediction by 
going through the process of prediction and descrip- 
tion again. Their first set of constructions and predic- 
tions was then returned to them and they were given 
a new set of blanks. It was suggested to each S that 
although his descriptions had not been checked this 
time, they would be checked next along with his 
revised predictions, and that he might wish to change 
some of them. This then yielded a second set of con- 


structions and predictions, in one instance following 
high invalidation and in the other, low invalidation. 
The actual order in which high and low invalidation 
occurred was systematically alternated from S to S. 


RESULTS 


From Table 1 it will be seen that the null 
hypothesis concerning differences in amount 
of reconstruction on constellatory and propo- 
sitional constructs under conditions of high 
invalidation may be rejected at the .01 level of 
confidence. This provides support for the first 
experimental hypothesis that under conditions 
of high invalidation, a greater amount of recon- 
struction occurs of constellatory than of propo- 
sitional constructs. Under conditions of low 
invalidation, the difference in amount of recon- 
struction on constellatory and propositional 
constructs is not significant. However, the 
second experimental hypothesis predicted that 
under these conditions, the extent of recon- 
struction of propositional constructs would be 
greater than or equal to the reconstruction of 
constellatory constructs. Consequently, the 
experimental hypothesis is not necessarily 
denied by these null findings. Finally, the third 
experimental hypothesis that constellatory 
constructions are more sensitive than propo- 
sitional constructions to degree of invalidation 
is supported by the significant difference be- 
tween differences in amount of reconstruction 
of constellatory and propositional constructs 
following high and low invalidation. 

In each instance, then, some support was 
found for the predictions made regarding the 
relationships between amount of invalidation, 
kinds cf constructs, and amount of reconstruc- 
tion. It would thus appear that, in the sense of 
mediating differential predictions, both the 
conceptual distinction between constellatory 
and propositional constructs and the operations 
used in making this distinction are meaningful. 
Further, if one accepts the degree of invalida- 
tion needed to bring about a given amount of 
reconstruction as an indication of the suscep- 
tibility to reconstruction of that construct, the 
data suggest an inverse relationship between 
susceptibility to change and range of inter- 
dependency. 

It will be recalled that the method by which 
constellatory and propositional! constructs were 
distinguished was essentially a mechanical one. 
In order to determine whether such a procedure 
was necessary, or whether the distinction 
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TABLE 2 


TyprcaL CONSTELLATORY AND 
PROPOSITIONAL CONSTRUCTS 


—S———— _—— anaen 


Constellatory Constructs 








. Military mind vs. few strong concepts 

. Conceit vs. unconceited 

. More reserved vs. happy-go-lucky 

. Very active vs. very lazy 

Loquacious—easy to know vs. introvert—quiet 
Mean to me vs. nice to me 

- Unfrivolous vs. one who “‘puts on” 

. Sensible vs. not too sensible 

. Happy vs. dissatisfied 

. Don’t give up vs. always give up 
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Propositional Constructs 





. Intelligent vs. average 

. More carefree vs. more responsible 

. Understanding vs. not understanding 

. Team work vs. egotist 

Very reserved in manners vs. happy-go-lucky 

Self-centered vs. not quite so much 

Very conservative vs. liberal minded 

More serious vs. less serious 

. Always making jokes vs. dead pan 

There’s room for improvement vs. has reached goal 
of ethics 
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might be made just as well on some intuitive 
basis, six advanced graduate students, all of 
whom were familiar with Kelly’s theory, were 
given a list of forty constructs taken at random 
from RCRT protocols and were asked to deter- 
mine which were constellatory and which 
propositional. The judges were informed that 
half of the constructs belonged in each cate- 
gory. Table 2 presents a random sample of the 
constructs contained in this list. None of the 
judges was able to do better than chance. 
Essentially, this would suggest that whether a 
given term is constellatory or not depends 
strongly upon how the individual uses it rather 
than upon its semantic meaning. This 
functional character of constructs can only be 
determined by some procedure such as Kelly’s 
which reflects individual usage. 


DISCUSSION 


It is possible to view the personal construct 
repertoire as a coding system containing a finite 
number of elements or constructs. If the pur- 
pose of such a system is to aid in the discrimina- 
tion and anticipation of events, then one would 
expect that its efficiency would decline as the 
independence between the elements decreased. 
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Thus, it would be expected that the individual 
whose construct repertoire comprises primarily 
constellatory constructs would be less sensitive 
to the subtle nuances of interpersonal situa- 
tions. For the same reason, one would expect 
communication to be more difficult and less 
efficient as the number of constellatory con- 
structs employed by the individual increased. 

These considerations raise the normative 
problem of the relative proportions of constel- 
latory and propositional constructs desirable in 
a construct repertoire. If it is the degree of 
relationship between two constructs which 
permits the individual to formulate proposi- 
tions of the form “If x then y,” e.g., “If this 
person is friendly then he can be trusted,” then 
complete independence between all of the con- 
structs in an individual’s construct repertoire 
would leave him completely at sea in terms of 
making predictions and generalizations. On 
the other hand, if all constructs were perfectly 
correlated with each other, again the individual 
would be at a loss in dealing with a changing or 
probabilistic world. While neither of these ex- 
tremes seems likely of occurrence, the question 
can be posed as to whether there is an optimal 
point between them. 

In attempting to bring about some change in 
an individual’s mode of response or general ad- 
justment, the therapist or educator makes use 
of a variety of techniques, al] of which seem to 
rely upon some principle of differential rein- 
forcement. The present findings suggest that 
these procedures may vary in their effective- 
ness with the extent to which the elements in- 
volved are constellatory or propositional. It is 
quite probable that whenever the therapist or 
client ascribes failure to change as being due to 
“too much at stake” in the change, they are 
referring to a situation which has been con- 
strued in a constellatory fashion. Thus, a 
knowledge of the content and structure of an 
individual’s personal construct system and a 
set of procedures which might reduce the con- 
stellatory nature of certain constructs would 
seem to be important in psychotherapy or any 
other situation where our aims involve changes 
in perception and behavior. 

With respect to the latter point, many of the 
procedures now used by therapists may have 
as one of their effects the reduction of constel- 
latory constructs. One might mention as one 
prominent example the technique of transfer- 
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ence analysis. Here the client has transferred to 
the therapist an entire constellation of con- 
structions which he had formerly applied to 
some other important figure in his life, e.g., 
his father. In analyzing the client’s transference 
reaction, the therapist may be helping the 
client to use such constructs as male vs. female, 
authority vs. nonauthority, dominating vs. not 
dominating, hostile vs. not hostile, arbitrary 
vs. not arbitrary or understanding, in a more 
propositional way, thus permitting him to 
establish role relationships with other males or 
authority figures where he does not continually 
anticipate that (and hence act as though) he 
will be aggressed against, punished, etc. Fur- 
ther research might profitably be directed to- 
ward the development and assessment of 
techniques aimed at reducing the constellatory 
nature of constructs. 


SUMMARY 


Starting with the assumption of the psychol- 
ogy of personal constructs that each individual 
develops and makes use of a set of constructs 
in attempting to predict and control his envir- 
onment, the effects of apparent failure in pre- 
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diction were studied in relation to two kinds of 
construct (constellatory and propositional) 
differentiated on the basis of their range of 
interdependency. Three specific hypotheses 
were tested and each found experimental sup- 
port. From these findings, it might be con- 
cluded that there is an inverse relationship 

between the range of interdependency of a 

construct and its susceptibility to change fol- 

lowing predictive failure. Several possible 
implications of these findings are discussed. 
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HUMOR AND ANXIETY 


JOHN DORIS anp ELLA FIERMAN 


Yale University 


OLLOWING Kris’ lead (5, 6) various psycho- 
analytically oriented writers (4, 7, 8, 10) 
have pointed out that Freud’s position as 

stated in W1! and Its Relation to the Unconscious 
implies a relationship between humor and anx- 
iety. These writers have emphasized the aggres- 
sive and sexual, as well as the nonsensical, 
aspects of humor. This literature raises the ex- 
perimental question as to whether subjects 
differentiated on the basis of a_ self-rated 
anxiety scale respond differently to humorous 
stimuli. 


METHOD 


A self-rated general anxiety questionnaire! was ad- 
ministered during regular class sessions of introductory 
psychology courses to 419 Yale men and 197 Con- 
necticut College women.? From the distributions of 
scores on this questionnaire, for each of the respective 
colleges, extreme groups of high anxious (HA) and 
low anxious (LA) subjects (Ss) were selected. The 
Yale Ss were selected from within the upper and lower 
6 per cent of their distribution and the Connecticut Ss 
were selected within the upper and lower 9 per cent. 
There were 56 Ss in all, 28 HA and 28 LA. Each 
anxiety group contained equal numbers of men and 
women. The Ss were not told that their selection 
for participation in the humor section of the study 
was based upon the anxiety questionnaire that they 
had filled out several months previously. Participation 
in the humor study was on a volunteer basis but of 
all students approached only three refused to serve. 

The humor stimuli were cartoons selected from the 
Mirth Response Test (10) which consists of cartoons 
from The New Yorker and similar magazines.* The 
cartoons of the Mirth Response Test were given to a 
group of 15 psychiatrists, psychologists, and psychi- 
atric social workers who rated each cartoon on the 
amount of aggression, sex, and nonsense in the cartoon. 
The ratings were made on graphic scales which were 
descriptively labeled to indicate increasing amounts 


! This questionnaire is a revision of the Question- 
naire on Adult Forms of Anxiety and Worry used by 
Gordon and Sarason (2). Copies may be obtained 


from the present authors upon request. 

2 The authors wish to express their thanks to Dr. 
S. B. Sarason of Yale and Dr. M. Applezweig of the 
Connecticut College for Women for their cooperation 
in conducting the experiment at their respective 
schools. 

*The authors would like to express their thanks 
to Dr. F. C. Redlich and Dr. J. Levine of the Yale 
Psychiatry Department for their permission to use 
materials from the Mirth Response Test and for their 
generous cooperation in making this study possible. 
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of the characteristic being rated, and subsequently 
scored in equal linear units from 1 through 9. The 
ratings were then averaged for each cartoon and three 
groups of six cartoons each were selected from the 
original 31, each composed of cartoons in which either 
the sexual, aggressive, or nonsensical aspect of humor 
was respectively predominant. Thus, the six aggressive 
cartouns received a mean rating of 7.8 on aggression, 
1.4 on sex, and 5.9 on nonsense. The sexual cartoons 
were rated 3.6 on aggression, 7.0 on sex, and 5.4 on 
nonsense. Ratings on the nonsensical cartoons were 
3.2 on aggression, 1.4 on sex, and 6.8 on nonsense. 

In the experimental situation the cartoons were 
presented to S one at a time in a standardized order 
that intermingled the various content categories of 
aggression, sex, and nonsense. The S’s vocal and facial 
responses to the cartoons were rated by the examiner 
on a 6-point scale from disapproval to pronounced 
pleasure.* After the S had examined all cartoons he 
was asked to go over them again indicating his degree 
of preference on a graphic rating scale extending from 
Very Much Disliked to Very Much Liked. The S was 
then asked to describe the point of each joke. The 
examiner (£) wrote down this description and it was 
later scored for comprehension of the cartoon. 

There were two examiners, one male and one fe- 
male. Each examiner tested 28 Ss: 7 men and 7 women 
from each of the anxiety groups. An effort was made 
to select the Ss in such a fashion that at the time of 
testing the examiner would not know whether the S 
being tested belonged in the HA or LA group.. With 
the exception of four cases this was possible. 


RESULTS 


The Ss’ explanations of the cartoonswere first 
scored for comprehension according to criteria 
established on the basis of the investigators’ 
previous experience with the cartoons. These 
criteria, with minor modifications, were similar 
to those recommended by the originators of the 
Mirth Response Test. The investigators were 
able to score independently a sample of 
144 cartoon interpretations with 95 per cent 
agreement. 

Scoring of the cartoon interpretations re- 
vealed that approximately 85 per cent of the Ss 


‘In an attempt to estimate the reliability of the 
mirth-response ratings, six Ss other than those used 
in the experiment were asked to look through the 31 
cartoons of the Mirth Response Test while the authors 
made independent judgments of their mirth responses. 
The reliability coefficients of the authors’ judgments 
for these six Ss were respectively .80, .76, .90, .45, 
.74, 83. 
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TABLE 1 
MEAN PREFERENCE SCORES FOR CARTOONS BY 
ANXIETY GROUPS 
(N = 28 in each group) 


= = = 
Nonsensical} Aggressive | Sexual All 


| Cartoons | Cartoons | Cartoons| Cartoons 
— | | —_— —— 
High Anxious 8.7 9.2 |} 8.0 8.7 
Low Anxious 9.3 10.2 8.2 9.4 
a 1.59 2.25 0.35 1.95 
? 20> <.10 .05> <.02 = 10> <.05 


* Tests for homogeneity of variance were done for all ¢ tests 
reported in this study. In no case was rejection of the hypothesis 
of a common population variance indicated 


support to the assumption, that the preference 
scores of the Ss in these few instances did not 
materially distort the means of the groups. 

For each of the cartoons retained for further 
analysis, S had a rating from 0 to 15 represent- 
ing his preference on the graphic scale and a 
rating from 0 to 5 based on £’s judgment of his 
mirth response. The respective means of these 
ratings on each set of sexual, aggressive, and 
nonsensical cartoons, and on all 16 cartoons 
combined, were considered to be S’s preference 
and mirth-response scores. Comparisons be- 


TABLE 2 


MEAN PREFERENCE SCORES FOR CARTOONS BY ANXIETY GrouPS SUBDIVIDED 
ACCORDING TO EXAMINER-SuUBJECT SEX COMBINATION 


(N = 14 in each group) 








Nonsensical Aggressive Sexual All 
Cartoons Cartoons Cartoons Cartoons 

Opposite Same Opposite Same Opposite Same Opposite Same 

Sex Sex Sex Sex Sex Sex Sex Sex 

High Anxious 8.4 9.0 9.2 9.3 7.7 8.2 8.6 8.9 
Low Anxious 9.5 9.1 11.0 9.4 8.2 8.2 9.7 9.0 
t 2.09 — 2.86 _ 0.56 -— 2.44 —_ 

p 0s >< 02 — o1>< .o1 — -- —- 0os>< .02 — 


misunderstood two of the sexual cartoons.® It 
was decided to remove these cartoons before 
scoring the preferences of the Ss.* The point of 
the joke being missed, it seemed invalid to com- 
pare Ss on their appreciation of the humor; and 
since Ss in HA and LA and male and female 
categories had equal difficulty in comprehen- 
sion, it was felt that there was no connection 
between failure to understand these cartoons 
and the major variables and categories of inter- 
est. For the remaining cartoons there were only 
a few scattered cases of misinterpretation 
equally divided between groups, and for pur- 
poses of further analysis these few instances 
were treated exactly as the rest of the data. It 
was assumed, and inspection of the data lent 


’ A Cobean cartoon with his familiar “undressing” 
theme and a Kovarsky cartoon satirizing this theme 
These cartoons required certain knowledge that the S 
group may not have had at its disposal. Understanding 
of the Cobean cartoon required that one be able to 
recognize the appearance of an oriental eunuch. The 
Kovarsky cartoon required that one have some famil- 
iarity with the Cobean “undressing” theme. 

® The mean ratings by psychiatrists, psychologists, 
and social workers for the four retained sexual! cartoons 
were 7.84 on aggression, 7.08 on sex, and 6.84 on non- 
sense. 


tween the anxiety groups were then made on the 
bases of these scores. 

In terms of the average preference scores 
(Table 1) the HA group showed a tendency 
(.10 > p > .05) to give lower average ratings 
than the LA group on all types of cartoons com- 
bined. On the aggressive cartoons the preference 
scores of the HA group were lower (.05 > p > 
.02) than those of the LA group. There were no 
reliable differences between the groups on the 
nonsensical and sexual cartoons. 

Closer examination of the data revealed, how- 
ever, that not all members of the HA and LA 
groups were contributing equally to the ob- 
tained differences. Rather, it was those Ss tested 
by an examiner of the opposite sex that con- 
tributed the major part of the difference. The 
Ss were therefore classified not only according 
to their anxiety grouping but also according to 
the conditions of administration. This analysis 
is presented in Table 2. In the opposite condition 
the male E administered cartoons to female Ss 
and the female E administered cartoons to male 
Ss. In the same condition the male E admin- 
istered cartoons to male Ss and the female E 
administered cartoons to the female Ss. It is 
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TABLE 3 


MEAN MirTH-RESPONSE SCORES FOR CARTOONS BY 


Anxiety Groups WitH EXAMINER AND SUBJECT 
or Opposite SEX 


N = 14 in each group) 


Nonsensical — Sexual All 

Cartoons Car- Car Car- 
toons toons toons 
High Anxious 3.30 3.40 | 3.51 | 3.39 
Low Anxious 3.83 3.79 | 3.53 | 3.74 
i 2.18 1.28 — 1.46 


p 05> <.02); — ~ 


now noted that differences between HA and LA 
Ss exist only in the opposite condition for the 
nonsensical (.05 > p > .02), aggressive (.01 > 
p > .001), and all cartoons combined (.05 > 
p > .02). No significant difference is found in 
the same condition for nonsensical, aggressive, 
and all cartoons, nor in either condition for 
sexual cartoons. 

Mirth-response scores were treated in the 
same manner as preference scores. There were 
no differences between total HA and LA groups 
in the average mirth-response scores. Since 
analysis of the data according to conditions of 
administration proved enlightening for the 
preference scores, however, a similar analysis 
was carried out for the mirth-response scores, 
which revealed a rather similar pattern to that 
obtained with the preference scores. There were 
no differences between the anxiety groups in the 
same condition of administration while there 
was some tendency of the HA group to score 
lower than the LA group in the opposite con- 
dition. However, only in the case of the non- 
sensical cartoons (.05 > p > .02) was the 
difference significant. Table 3 reproduces only 
the results for the opposile condition for the 
mirth-response scores. All differences for the 
same condition were negligible. 


DISCUSSION 


Two results seem to stand out in this experi- 
ment. First, for this group of Ss, there seems to 
be a relationship between S’s rating of his sus- 
ceptibility to anxiety as determined by a general 
anxiety scale and his preference rating for car- 
toons of aggressive content. Second, this rela- 
tionship depends upon the social context of the 
humor stimuli; i.e., results depend upon which 
S is tested by which E. 
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This interrelationship of anxiety, preference 
rating, and social context also seems to be pres- 
ent in the case of nonsensical cartoons, a]though 
evidence is not as clear-cut as with the aggres- 
sive cartoons. In the case of the sexual cartoons, 
there was no differentiation of HA and LA 
groups under any condition. It may be that the 
smaller number of sexual cartoons made for less 
reliable measurement. Further study is neces- 
sary to explain this failure of the sexual cartoons 
to differentiate. 

These results suggest that further study of 
the relationship between humor and anxiety 
may be profitable. They strongly indicate, how- 
ever, that such experimentation should not 
focus exclusively on the stimulus materials, i.e., 
jokes or cartoons, per se. The social context 
appears to be crucial. In this, the results parallel 
those of Perl (9), who demonstrated that Ss 
rated jokes differently when tested alone than 
when tested in a group. In our study, the person- 
alities of the Es must have provided cue differ- 
ences which altered the situation for S. Stand- 
ardized administration procedures and E’s lack 
of knowledge concerning the anxiety classifica- 
tion of Ss during testing suggest that these cue 
differences were not the result of E’s consciously 
altering his behavior for the different groups of 
Ss.’ While the exact nature of these cue differ- 
ences cannot be determined on the basis of this 
experiment, the sex difference would seem most 
immediately obvious to the Ss, and appears 
worth further investigation. Such an investiga- 
tion would, of course, require a more representa- 
tive sample of experimenters than the design of 
this study permitted (3). 

That the mirth-response score did not dis- 
criminate as well as the preference score is 
of interest in view of the fact that other investi- 
gators (7) found a mirth-response score ade- 
quate for discrimination within a patient 
population of HA and LA Ss who were selected 
on the basis of E’s estimate of their anxiety level 
while in the test situation. Further study would 
seem to be indicated to determine the relative 
merits of these two scores in humor studies, as 
well as the alternative techniques of selecting 
anxiety groups. 


7 During the test administration the examiners at- 
tempted to classify the Ss they saw as either HA or 
LA. Their success in this classification did not exceed 
chance expectations. 
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FACTORS IN THE SERIAL RECALL OF NAMES OF ACQUAINTANCES! 


RUE L. CROMWELL 


George Peabody College for Teachers 


HE studies reported here examine from a 

novel perspective the influence on serial 

recall of several factors, most of them 
familiar. Conventionally, the process of recall 
is investigated either by experimentally vary- 
ing the conditions of learning and reinstate- 
ment, on the one hand, or by soliciting the un- 
controlled recollection of life experiences, on 
the other. The present studies attempt to pre- 
serve the advantage of the more naturalistic 
kind of investigation in which the learning 
underlying recall has occurred in the normal 
course of life experience, while retaining a de- 
gree of control more characteristic of experi- 
mental studies. In each case, the task set the 
subject involves recalling the names of ac- 
quaintances that fit certain specifications. The 
order of recall is noted. After recall, the subject 
is requested to rank the names with respect 
to the variables under investigation. 

In the first study, five variables are exam- 
ined: frequency, recency, and pleasure, from 
associationism and hedonistic theories; degree 
of acquaintance (on common-sense grounds); 
and a factor drawn from the personal construct 
theory of Kelly (3). The second study serves a 
dual function. While it is primarily designed to 
check findings obtained in the previous study, 
it also permits a more extended analysis of the 
pleasure variable. 

Stupy I 

The hypotheses underlying the first study 
stated in terms of the recall-ranking situation, 
are as follows: 

1. Pleasure hypothesis: The order of recall is 
positively correlated with the order in which 
the recalled elements are ranked in terms of 
pleasure. The rationale for this hypothesis can 
be drawn from the hedonistic formulations of 
psychoanalysis. According to Freud there is 
“the working of a tendency to ward off from 
memory that which is unpleasant” (2, p. 68). 


1 This article is based on a master’s thesis at The 
Ohio State University and a follow-up experiment. 
Advice and criticism on various phases of the study were 
received from George A. Kelly, Paul Mussen, Richard 
Lundy, Robert J. Wherry, Raymond Norris, and 
Rupert Klaus. 


More “‘pleasant”’ objects, by this hypothesis, 
should be recalled early in a recall sequence, 
less “pleasant” ones later on, and very “pain- 
ful” ones “repressed” and therefore unavail- 
able to immediate recall. 

2. Frequency hypothesis. The order of recall 
is positively correlated with the frequency of 
contact with the recalled elements. 

3. Recency hypothesis. The order of recall is 
positively correlated with the recency of con- 
tact with the recalled elements. 

4. Acquaintance hypothesis. The order of re- 
call is positively correlated with the degree of 
the S’s acquaintance with the recalled elements. 
This hypothesis is offered on the common-sense 
grounds that people recall best what they 
“know’”’ best. 

5. Construction hypothesis. The items re- 
called adjacently (in terms of sequence) are 
grouped by the S as similar to each other to a 
degree better than chance. The personal con- 
struct theory of Kelly (3) suggests that when 
an individual recalls a number of items from 
past experience, he recalls them in groups as a 
function of his own manner of classification or 
construction. When the recalled items are 
names of people, an S might recall college ac- 
quaintances, relatives, neighbors, and then 
teachers. In this way, items adjacent to each 
other in sequence of recall may tend to be 
grouped by the S as similar in contrast with 
nonadjacent recalled items. 


Method 


Subjects. Five elementary psychology classes, 151 
students in all, were used in Study I. The classes, 
approximately equal in size, were respectively assigned 
to the five experimental groups. Through failure to 
recall the eight names required in the instructions, 14 
cases were discarded. 

Materials. Each S was given a dittoed data sheet 
form and eight blank 244” x 3” cards. A blackboard was 
used to illustrate the instructions to each group. 

Instructions. The instructions, read verbatim to each 
of the five experimental groups, gave directions to 
perform the following tasks. 


1. The Ss were told they would be asked to recall, 
and write down in order as they recalled, the names of 
eight living acquair:tances. Each of these acquaintances 
was to have the same first name, which would be 
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TABLE 1 


MEAN RANKINGS FOR THE Four RANKED VARIABLES 


Ranking P F i A 

First 3.36 3.39 3.58 3.20 
Second 3.88 3.7 4.28 4.05 
Third 4.31 4.45 4.10 4.39 
Fourth 4.38 4.62 4.56 4.33 
Fifth 5.11 4.55 4.34 4.89 
Sixth 4.96 4.77 4.80 4.62 
Seventh 4.93 5.14 5.11 5.41 
Eighth 5.07 5.36 5.23 5.11 


Note frequency; C, recency; and A, 


acquaintance 


P stands for pleasure; F, 


TABLE 2 
INTERCORRELATIONS AMONG THE RANKED VARIABLES 
AND THE ORDER OF RECALL 


R P F Cc A 


Order of recall (R) x 

Pleasure (P) 91 x 

Frequency (F) 96 .89 x 

Recency (C 94 82 91 x 
Acquaintance (A 90 93 91 88 x 


specified by the experimenter. The S’s were to write 
each acquaintance’s name on a blank card, which was 
numbered on the reverse side to indicate the order of 
recall. As soon as Ss indicated they had understood 
these instructions, a common American first name was 
announced and they were allowed 20 minutes to recall 
and write down the names of eight people with this 
first name. Group I was given “John;” Group II was 
given “Mary;” Group III, “James” or “Jim;” Group 
IV, “Jane;” Group V, “William.” 

The order of the following steps (2 to 6) was system- 
atically varied from group to group. The order given 
below was presented to Group I only 
asked to rank the eight recalled 
names in terms of “how well you like the persons you 
The S then recorded this ranking on 


2. The Ss were 


have recalled.” 


his data sheet by referring to the card number in terms 
of order of recall 

3. The Ss were then asked to rank the recalled 
persons “in terms of how much contact you have had 
with these persons during your life.”” This ranking was 
likewise recorded on the data sheet 

4. Next, the Ss were asked to rank and record the 
been since 
the most 


recalled names 
you have had contact with each person,” 
recent being ranked first 

5. Then, the Ss were asked to rank and record the 
cards “‘in terms of how well you know each person,”’ the 


“in terms of how long it has 


best known person being first. 

6. The Ss were then told: 
three groups in terms of how they are alike in some way. 
Put any number in each group. What basis you have 
for the distinction is up to you.”’ The group member 
ships were recorded with respect to the initial order of 


“Group the cards into 


CROMWELL 


TABLE 3 


PARTIAL CORRELATIONS BETWEEN THE RANKED 
VARIABLES AND THE ORDER OF RECALL 


Partial correlation 


with the order Significance level 


Variable of recall (df = 3) 
Pleasure Tor.tea = .61 p> .20< .30 
Frequency Tir.pen = .61 p> .20< .30 
Recency Tor.pta = .68 p> .2O< .W 

Tar.pfc = — 4 p> 


Acquaintance .40 < .50 


recall for each name. The “bases for the distinctions” 
were also written on the data sheet. At this point the 
S’s data sheet, now complete, contained four rankings 
and one grouping. 

Controls. The “first name” stimulus word and its 
designated sex were varied from group to group. In 
order to reduce or compensate for ranking habits, the 
order of the ranking tasks was systematically varied 
from group to group, the recalled names were put on 
separate cards, the cards were numbered on the reverse 
side only, and they were shuffled by the Ss after each 
ranking and grouping was performed. 

Proper names were used as items for recall because 
they conveniently fit the procedure. The S was not 
allowed to use his own name or the names of deceased 
persons, to avoid cultural influences demanding accept- 
able ways of reacting to self and deceased 


Results 

The rankings on the several variables were 
first summated throughout the 137 Ss, and 
then mean ranks were computed. A single 
table of mean ranks (Table 1) was therefore 
obtained, rather than the many individual 
ranks. 

To evaluate the magnitude of the trends ap- 
parent in Table 1, product-moment correla- 
tions were computed between the mean ranks 
for each ranked variable and the (mean) order 
of recall.* Each of the four correlations, pre- 
sented in the first column of Table 2, was sig- 
nificantly above zero with probability levels 
beyond the .01 level. The reader is reminded 
that these correlations describe the smoothed- 
out group trend and are consequently much 
higher than individual rank-difference correla- 
tions would have been. 

Since the ranked variables correlate with 
each other as well as with the order of recall 
(Table 2), partial correlations were computed 


?It was found that reranking the mean ranks and 
also scaling the ranks for normality brought correlations 
which were not appreciably different from the product- 
moment correlations 





YKED 


evel 


30 
30 
30 
50 


1 its 
». In 
_ the 
aried 
t on 
verse 


each 


ause 
not 

ased 

ept 


vere 
and 
ngle 
fore 
lual 


ap- 
ela- 
inks 
rder 
pre- 
sig- 
vels 
ded 
red- 
uch 
ela- 


vith 
cal] 
ited 


and 
tions 
luct- 





FACTORS IN SERIAL RECALL 65 


TABLE 4 
EXPECTED AND OBSERVED NUMBER OF PEOPLE 
NAMING “ISOLATES” 


Number of Expected Observed 
isolates frequency frequency 
0 2.01 4 
1 9.10 10 
2 19.94 24 
3 22.19 20 
4 35.32 34 
5 12.11 9 
6 28.26 23 
8 8.07 13 

N = 137.00 N = 137 


(Table 3). Pleasure, recency, and frequency re- 
tain a positive relationship with the order of 
recall. Acquaintance becomes negative. How- 
ever, none of these partial correlations was sig- 
nificant at the .05 level of confidence. 

With the construction hypothesis, the prob- 
lem was to determine the degree to which adja- 
cent objects in the sequence of recall were 
grouped together as similar. An object that 
was grouped as similar to neither of the adja- 
cently recalled objects would be directly 
contrary to the hypothesis. Such negative in- 
stances, which may be called “‘isolates,” facili- 
tated the testing of the hypothesis in reciprocal 
form: that the number of isolates per S is fewer 
than would be expected by chance. 

Since each of the three groups could have 
any number of items in its membership, it was 
necessary to derive the expected frequencies 
for all the possible arrangements of order and 
classification of the items. These various ar- 
rangements were in turn classified according to 
the number of isolates in each. Finally, the 
expected number of isolates per individual 
could be compared with the observed number 
of isolates per individual. (For a full account 
of this derivation, see 1, p. 42-46.) Table 4 
shows the expected and observed number of 
people having isolates at each possible fre- 
quency level. The most convenient and favor- 
able dichotomization by which to test the hy- 
pothesis with one-by-two chi square divides 
the group between the two- and three-isolate 
categories in Table 4. With this division point, 
the chi-square test shows a trend which falls 
short of significance. (x? = 2.04; .20 > 
p > .10). 


Stupy II 


The mean data for the four ranked variables 
in the experiment just described are very simi- 
lar to each other (see Table 1). The possibility 
is raised, therefore, that some extraneous vari- 
able is exerting an influence common to all 
these data. For example, extraneous ranking 
habits, rather than the variables actually being 
studied, could have led to the similarly posi- 
tive findings. 

On the assumption of such a spurious rela- 
tionship it would be expected that an assumed 
neutral variable, such as height, would also be 
positive because of the extraneous influence. 
In this study the pleasure variable was com- 
pared with ranking in terms of height. It was 
predicted that with the same group of subjects 
the pleasure variable would have a significant 
relationship to the order of recall and the 
height variable would not. 

The second aspect of this study is based on 
an observation made in the first study. During 
the recall period in the first study the subjects 
were observed to write down two or three or 
four names immediately and quickly. After 
this, they paused for great lengths of time until 
all ‘the eight names were recalled. These ob- 
servations of the recall behavior were ignored 
by the experimenter until the data of the first 
experiment were analysed. It was then noted 
that differences occurred between the rankings 
of the later and earlier recalls. As can be seen 
in Table 1, the pleasure trend levels off, the 
recency and freguency trend becomes more 
regular, and the acquaintance trend becomes 
more irregular. Thus, the possibility arose that 
these changes in the trends of the ranking 
might be related to the change from the imme- 
diate, tip-of-the-tongue recall to the slower, 
harder-to-recall names. 

The study in its second aspect is therefore 
concerned with one of these rankings, the 
pleasure ranking, and its relation to the avail- 
ability of the recalled items. Since the curve for 
pleasure levels off in the functional relation- 
ship with the order of recall, it is hypothesized 
that the order of recall before the initial pause 
has a greater relationship to the pleasure rank- 
ing than has the order of subsequent recall. 


Method 


Subjects. A group of 35 girls from freshman psy- 
chology classes took part in the experiment as a part of 
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course requirements. Ten of the Ss were rejected be- 
cause of failure to recall the eight names required. One 
S failed to indicate the point of first pause. 

Instructions. The instructions given the Ss before 
the recall period were identical to those in the first 
experiment, except that only the pleasure and height 
variables were ranked. The Ss were asked to recall and 
write down the names of eight living acquaintances 
whose first name was John. The names were put on 
individual cards with the order of recall indicated by 
numbering on the reverse side. The period for recalling 
names was approximately 15 minutes. At the end of the 
recall period, Ss were asked to indicate on their data 
sheet the number of names which they recalled im- 
mediately without having to pause. The Ss were then 
asked to shuffle the cards and then rank them in terms 
of how well they liked the individuals they had recalled. 
The ranking was recorded on the data sheet. The Ss 
were finally asked to rank the cards according to the 
height of the persons named. Again, the ranking was 
recorded on the data sheet. 


Results 


The mean ranking of pleasure correlated 
+.60 with the order of recall, significantly 
greater than zero. Between the mean ranking 
for height and the order of recall the correlation 
was +.18, not a significant degree of relation- 
ship. The correlation between pleasure-ranking 
and order of recall was not as great as that re- 
ported in the first experiment, presumably be- 
cause the V here was smaller. It was greater 
than the correlation between height and order 
of recall to a probability level of .06. Thus, the 
first hypothesis was confirmed. 

The point of first pause, as reported by the 
S, was used to split the pleasure-ranking of the 
eight names into two groups. The first group 
contained the names recalled before the first 
pause. The second group contained the names 
recalled after the first pause. Because the point 
of pause varied from S to S, the sizes of the two 
groups also varied from S to S. Table 5 shows 
the distribution of the Ss in regard to the point 
of pause. 

By separating the group of eight names at 
the various points indicated in Table 5, the 
group sizes for an individual S was either one 
and seven, two and six, three and five, or four 
and four. The original ranking of the eight 
names was reranked within these two smaller 
groups. Rank-difference correlations were then 
computed from each of the smaller groups. (An 
exception existed in the one case with the one- 
seven split of the eight names. Here, with only 
one set of scores, a correlation among the early 
recalls was impossible.) This procedure yielded 


TABLE 5 
DISTRIBUTION OF THE POINTS OF First PAUSE IN 
RECALLING E1cut NAMES 


Occurrence of first pause 


Frequency 


Before Ist name 0 
Between Ist and 2nd name 1 
Between 2nd and 3rd name 5 
Between 3rd and 4th name 10 
Between 4th and 5th name 4 
Between 5th and 6th name 2 
Between 6th and 7th name 2 
Between 7th and 8th name 0 

N = 24 


two rank correlations for each S, one describing 
the relationship between pleasure and order of 
recall before the initial pause, the other de- 
scribing the relationship between pleasure and 
the order of recall after the initial pause. 

A sign test was used to test the second hy- 
pothesis. For each S the chance expectancy 
that one rank correlation exceeds the other is 
.50. Among the 24 cases, 18 were in the direc- 
tion of the hypothesis, with the correlation 
based on the earlier recalls greater than that 
based on the later recalls. Four cases were in 
the direction opposite the hypothesis. One S 
had equal correlations and, therefore, was 
counted as contrary to the hypothesis. The re- 
maining case, noted in the preceding para- 
graph, was not subject to rank-correlation 
comparison because of the one-seven split. If 
the latter case is ignored, the final count would 
be 18 positive observations out of 23. If the 
latter case is included on the negative side, the 
final count would be 18 out of 24. In either in- 
stance, the sign test indicates significant results 
in the predicted direction at the .01 level of 
probability. 


Discussion 


The first four hypotheses in the first study 
were substantiated by the zero-order correla- 
tion to a high degree. How well the Ss liked a 
person, how much contact they had had with 
him, how recently they had seen him, and how 
well they “knew” him were all directly related 
to how promptly they recalled his name. 

However, the partial correlations between 
order of recall and pleasure, recency, frequency, 
and acquaintance, respectively, fell short of sig- 
nificance. It is to be noted here that the test of 
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significance assumes in each case that eight 
pairs of observations were made rather than 
137. In general, the partial correlation findings 
suggest that much of the same thing is being 
measured in each of the correlations. 

The construction hypothesis had the under- 
lying assumption that people recall names cate- 
gorically, or in groups, as a function of the way 
they conceptualize their past interactions. The 
hypothesis was tested by having the S regroup 
the names after the recall period and use spe- 
cifically three groups. This method brought a 
nonsignificant trend in the predicted direction 
(.20 > p > .10). 

In the second study the earlier recalls in a 
sequence were shown to have a greater rela- 
tionship to the pleasure ranking than the later 
recalls. This finding rules out the possibility 
of a linear relationship between the two vari- 
ables. The curve of pleasure ranking as a func- 
tion of order of recall levels off, as is suggested 
in Table 1. 

Whether the reported first pause, here used, 
is the most appropriate point at which to di- 
vide the earlier from the later recalls remains 
to be established. Presumably, the most useful 
division would be the one yielding the sharpest 
difference in relationships when comparing the 
earlier recalls with the later ones. In the present 
study it seemed reasonable to the experi- 
menter, and communicable to the Ss, to sepa- 
rate those few immediately available recalls, 
which required no pause, from the later, more 
difficult recalls. 

In more general terms, the results of these 
studies seem to have contributed to the know]l- 
edge of serial recall in a controlled association 
situation. The order in which the things are re- 
called, as a person reflects upon his past inter- 
actions in response to instructions, is to some 
extent predictable. The results also suggest 
that, for the time being, no single theory of 
serial recall need be accepted to the exclusion 
of other theories. Different theories and differ- 
ent points of view were employed equally well 
in predicting the present results. Moreover, a 
new kind of methodological! approach has been 
suggested. A refinement of the five variables 
studied here and an exploration of other vari- 
ables might afford a still better understanding 
of this temporal aspect of verbal behavior. 


SUMMARY AND CONCLUSIONS 


In the first experiment, 137 subjects in five 
groups were given a first name and were asked 
to recall eight living acquaintances who had 
this name. They were then asked to rank the 
recalled names as to (a) how well they like 
each person, (6) how frequently they have had 
contact with him, (c) how recently they have 
had contact with him, and (d) how well they 
know him. A fifth task was to put the recalled 
names into three groups so that each group was 
alike and at the same time different from the 
other groups. These five tasks yielded rankings 
referred to as the pleasure, frequency, recency, 
acquaintance, and construction variables, re- 
spectively. The order of tasks was varied 
among the five groups. 

1. Rankings on the first four tasks ‘were 
highly correlated with the order in which the 
names were recalled. 

2. On the grouping task, order of recall fell 
short of a significant relationship with the con- 
structive groupings. 

In the second experiment, 24 subjects in one 
group were given a first name and were asked 
to recall eight living acquaintances who had 
this name. When the recalls were made, the 
subjects designated the point in the series at 
which they had to pause for the first time in 
order to recall more names. After this, they 
were asked to rank the recalled names in terms 
of how well they liked each person, and of the 
estimated heights of the individuals. 

1. The pleasure-order of recall correlation 
was significantly greater than zero, and the 
height-order of recall correlation was not. 

2. The pleasure-order of recall correlation 
was greater than the height-order of recall cor- 
relation at a probability level of .06. 

3. The order of the recalls mage before the 
initial pause had significantly greater relation- 
ship to the pleasure ranking than had the order 
of the later recalls. 
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tum (2) has recently obtained experi- 
mental results that are interpreted as 
lending support to the hypothesis that 
perceptual vigilance and defense phenomena 
are both consequences of repressed impulses. 
The experiment is of general theoretical in- 
terest since the obtained results represent per- 
ceptual selectivity phenomena which cannot be 
interpreted purely in terms of frequency or 
familiarity of the stimulus (4, 10, 11). On 
the contrary, the findings would seem te sup- 
port the contention that motivational states 
have direct perceptual consequences. 

The experimental hypothesis was derived 
from the psychoanalytic premise that repressed 
aggressive and sexual impulses are continually 
striving for outlet in conscious behavior. On an 


5 


unconscious or preconscious level individuals 
should therefore be responsive to environmen- 
tal cues relevant to the gratification of these 
potentially dangerous impulses (vigilance). As 
the impulses impinge on the ego, however, 
anxiety is aroused evoking the ego-defensive 
mechanisms (learned reactions to anxiety), 
thus avoiding the anxiety producing cues (de- 
fense). “‘In sum we are led by psychoanalytic 
theory to postulate two opposing tendencies 
vigilance and defense—and to predict the exact 
operation of each by controlling the level of 
awareness”’ (2, p. 94). 

Seven males and seven females served as 
Ss in the Blum experiment. The Ss were 
presented sets of four miniature Blacky pic- 
tures on a card in a mirror tachistoscope. 
Two of these pictures, Oral Sadism (OS) and 
Masturbation Guilt (MG), served as experi- 
stimuli. The remaining two, Oral 
Eroticism (OE) and Identification Process 
(I), were ‘“‘distractors’” or control stimuli. 
Level of awareness was defined in terms of the 


mental 


exposure time of the stimuli; .03 seconds cor- 
responding to a “low level of awareness” and 
.20 seconds to a higher level of awareness. A 
base series, to obtain the level of responsive- 
ness presumably determined by the physical 


properties of the stimuli, was first administered 
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to the Ss. This consisted of 54 trials at .03 
seconds with instructions to indicate which 
one of the four pictures “‘stands out” the 
most clearly. The Ss were then shown two 
regular Blacky cards (MG and OS, stimuli 
relevant to the repressed impulse) for the 
sensitization experience. The sensitization in- 
structions were intended to structure one 
picture as the “traumatic” stimulus for each 
S (arousal of the repressed impulse) and one 
as the neutral stimulus. 

The Ss were then administered 54 additional 
exposures to the cards at .03 seconds exposure 
time, again indicating on each trial which of 
the four stimuli appeared to “stand out” 
(vigilance series). The vigilance prediction 
was tested by comparing the ratio of trau- 
matic/neutral choices in the ‘“‘base’’ series 
with the same ratio in the vigilance series. 
Eleven Ss (six males and five females) shifted 
in the direction of choosing relatively more 
frequently the “‘traumatic”’ stimuli. The num- 
ber of shifts is significant at the <.05 level! 
of confidence as evaluated by the sign test. 

Following the presentation of the vigilance 
series the Ss were exposed to the stimuli under 
conditions necessary for testing the defense 
hypothesis. The Ss were informed that the 
two experimental pictures would now be 
present on each of the cards; their task was 
to locate the correct position of the designated 
picture. The Ss were instructed to locate the 
“traumatic” picture for 6 trials; then the 
neutral for 6 trials until each S had been 
given 36 such trials for each picture. The ego 
was “brought into play” further by exposing 
the stimuli for longer (.20 seconds) duration 
periods (higher level of awareness). The de- 
fense hypothesis was tested by comparing 
the number of correct locations of the trau- 
matic and neutral stimuli. The results con- 
formed to the prediction that Ss would make 
more errors in locating the traumatic picture 
(12 of 14; p = .006). 

1 The level of confidence reported was obtained 
using a one-tailed test 
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The fact that Ss after the sensitization 


experience were more responsive to the critical 
stimulus presented at low exposure periods 


and less accurate in the critical 
stimulus at increased periods of exposure was 
interpreted as lending support to the psycho- 
analytic assumptions regarding repressed im- 
pulses (2, p. 97). This paper presents experi- 
mental evidence that failure to control certain 
stimulus variables in the test of the defense 
hypothesis preclude unequivocal interpretation 
of these data as a function of a perceptual 
Brief consideration will 


locating 


defense mechanism. 
also be given to the possibility of alternative 
interpretations of the data obtained under 
these experimental conditions. 

It is correctly stated by Blum (2, p. 
that in order to attribute the changes in per- 
ceptual selectivity to the operation of vigilance 
and defense mechanisms it is necessary to 
determine the level of responsiveness to the 
test stimuli purely as a function of the physical 
properties of those stimuli. A further require- 
ment was that the four stimuli should elicit 
approximately equal frequencies of response 
in the control series (3). The pre-experimental 
base series presumably served as the appro- 
priate control for these stimulus factors in 
both the vigilance and defense series. How- 
ever, it should be noted that the two situa- 
tions present different problems of experi- 
mental control. The of the vigilance 
hypothesis involves a comparison of ratio 
scores which takes into account the individual’s 


95) 


test 


presensitization level of response to the critical 
and neutral stimuli; i.e., a statistical control 
for stimulus factors is employed. The relative 
frequency of response to the four stimuli in 
the base series (3) would also be an important 
consideration for providing a range of potential 
vigilance effect and for the application of 
certain statistical tests to the obtained scores. 

The distribution of these pre-experimental 
responses, however, is most crucial to the 
interpretation of the defense effects. The test 
of this hypothesis is based on a difference score 
derived from a comparison of the frequency 
of correct location of the critical and neutral 
stimuli in independent series of trials. The 
scores could be interpreted as reflecting defense 
effects only if it has been demonstrated that 
the two stimuli have equal probabilities of 
eliciting the correct response (i.e., equal ease 
of locating) prior to the experimental sensitiza- 
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tion experience. A large difference score (de- 
fense) could otherwise be purely a function of 
the distinctiveness of a particular stimulus as 
compared to one or more of the other three 
stimuli on the card. In order to interpret the 
results in terms of the defense hypothesis it is 
therefore necessary to assume that relatively 
equal frequency of response to the four stimuli 
in the base series (.03 seconds exposure) is 
adequate control for the differential ease of 
locating the stimuli in the series 
(.20 seconds). Consideration of results 
from an initial attempt to replicate the experi- 


defense 
the 


ment suggests this assumption is not tenable. 


EXPERIMENT I 


The procedures were as similar as possible 
to the original experiment with the exception 
that the control stimuli (i.e., “distractors,” 
I and OE) were not altered. The Identification 
picture (I)* consists of two distinct forms (a 
large and a small dog). If unaltered, the larger 
form is quite similar structurally to the critical 
picture for the female Ss (OS). The mean 
number of responses to the I picture was 23.79 
(base series) and 22.29 (vigilance); whereas 
the means of the other pictures for the base 
series were 13.96 (OE), 11.92 (OS), and 14.29 
(MG), and for the vigilance series, 11.21, 
13.17, and 16.67 respectively. The distinctive- 
ness of the I picture at .03 seconds exposure 
resulted in clearly asymmetrical responsive- 
ness in the base series. The obtained results 
yield statistically significant vigilance effects 
(17 of 22, 2 equal, p < .05) but nonsignificant 
defense effects (13 of 24, p > .25). These 
results could not be interpreted as contrary 
to the defense hypothesis since the require- 
ment of an even distribution of responses to 
the stimuli in the base series was not met. The 
data were examined further, however, in order 
to determine the effect of the distinctive 
stimulus on the differential ease of locating the 
critical and neutral pictures. 

Table 1 presents the distribution of mean 
frequency of response to each stimulus in the 
defense series. Condition A refers to the fact 
that all Ss were attempting to locate the 
position of the OS picture correctly, while 
under condition B, the MG picture was the 
relevant stimulus. 


? The failure to alter the OE control picture did not 
appear to affect the obtained results 
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TABLE 1 
MEAN FREQUENCY OF RESPONSE TO THE Four STIMULI 
IN THE DEFENSE SERIES OF EXPERIMENT I 


Condition A* 


OE MG OS I 
Females 4.00 3.33 14.08 10.58 
Males 2.33 3.34 16.58 9.75 


Condition B 
2. 


00 2.50 
1.42 3. 


83 


Females 


Males 1.50 


wu 
NN 


nN~ 
wu 


* Condition A refers to the fact that all Ss were attempting to 
locate the OS stimulus; this was the critical picture for females and 
the neutral picture for males. In Condition B MG was the stimulus 
to be located; this was the neutral picture for females and the 
critical picture for males 


According to the defense hypothesis females 
would correctly locate OS (in condition A) 
less often than MG (in condition B). The 
prediction for male Ss was that they would 
locate the MG (in condition B) picture cor- 
rectly relatively less often than the OS (in 
condition A) stimulus. However, the data in 
Table 1 indicate that both male (OS = neutral) 
and female (OS = critical) Ss have signifi- 
cantly fewer correct locations of the OS than 
of the MG stimulus (¢ = 3.06 and ¢ = 5.33; 
df = 11; p < .02). Ten females conformed to 
the “defense” hypothesis as compared to only 
3 male Ss.* The primary determinant of errors 
in locating the “traumatic” picture, however, 
was a generalization tendency between the 
OS and the simiiar I stimulus. Thus, the un- 
altered I picture differentially affected the 
difficulty of identifying the position of the 
OS and MG stimuli. The negative results in 
the test of the defense hypothesis could be 
attributed to this artifact in the experiment. 
That is, the generalization tendencies among 
the stimuli may have obscured the operation 
of the defense process. 

The question also arises, however, as to the 


extent to which the “perceptual defense” 


% A complete analysis of Blum’s (2) results was not 
available, but examination of the data available indi- 
cates that a similar in the defense 
series occurred in that experiment as in the first study 
reported here. That is, the male Ss manifested very 
small “defense” effects mean correct response to both 
OS (neutral) and MG (critical stimulus) is approxi- 
mately 24.00, whereas that of the female Ss is much 
larger (mean correct location of the OS, the critical 
stimulus, is 11.00; that of MG (neutral picture) was 
21.14). 


“sex difference” 


’ 


effects obtained by Blum (2) under similar 
experimental conditions are a function of this 
stimulus similarity variable rather than an 
active repression mechanism. The only control 
for stimulus properties was the essentially 
equal frequency of response to the four stimuli 
at .03 seconds exposure duration based on a 
preliminary group of Ss. The data from this 
experiment indicate the highly distinctive 
stimulus (at .03 exposure duration, I) differen- 
tially interfered with correct response to the 
experimental stimuli. This fact suggests that 
the stimulus properties which determine 
whether the stimulus “stands out” clearly at 
.03 seconds exposure duration is not identical 
(or equivalent) to those characteristics which 
determine the ease of correct discrimination 
at .20 seconds. 

The following experiment was conducted in 
order to test the defense hypothesis under the 
specified conditions; i.e., an even distribution 
of responses to the stimuli in the base series. 
The selection of stimuli was such that it was 
also possible to explore the alternate hy- 
pothesis; i.e., differential error in locating the 
stimuli may be primarily a function of differen- 
tial degrees of similarity between the critical 
stimulus and other stimuli in the set of four. 


EXPERIMENT II 


The I picture had proven to be perceptually 
distinctive in the base series and at the same 
time highly similar to the OS stimulus in the 
defense series. Therefore, an enlarged picture 
of Blacky from Card VII (A) was substituted 
and paired with the OS stimulus to form a 
critical-control pair for female Ss. The experi- 
mental critical-control pair for male Ss was 
formed by altering the OE picture until it was 
adjudged to be as similar to MG as A was to 
OS.‘ The four pictures were then arranged on 
each of 8 cards so that the position of each 
picture was systematically rotated, every 
picture appearing twice in each position. 

Under these conditions the relative inability 


‘Since Blum had obtained fairly equal frequency 
of response with MG, OS, and an altered OE stimulus 
these stimuli were retained. The of A to 
replace I was based on opinion of two judges that this 
picture was more similar to OS picture than to either 
of the other two. These judgments for similarity are, of 
course, not very precise; but were sufficient to serve 
the purposes of this experiment. 

5 Blum (2) used only 6 cards. 


selection 
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of Ss to locate a critical stimulus as compared 
to a neutral one could be attributed to a 
perceptual defense mechanism. Failure to 
obtain these results, however, would lend 
support to the hypothesis that the “defense” 
effect noted in (2), as in the group of female 
Ss in Experiment I, was some function of 
differential variation in the degree of stimulus 
similarity among the test stimuli. 


Method 


Subjects and apparatus. Ten males and ten female 
Ss were randomly selected from a sophomore high school 
class. The apparatus consisted of a mirror tachistoscope 
with approximately .68 foot-lamberts illumination in 
each exposure field. Exposure duration in the test- 
field was controlled by a Hunter decade interval timer. 
When seated at the viewing window the S fixated a 
small dot in the center of the lighted pretest field which 
corresponded to the center of the four stimuli presented 
in the test field. 

Procedure. The procedure and instructions were as 
similar as possible to those in the original experiment 
(2) with the following exceptions: (a) The sensitization 
instructions were typed and attached to the Blacky 
cards in order to minimize the interaction between 
E and Ss. (b) The changes in the control stimuli as 
indicated above. 

Results. The choice of the stimulus (A) to replace the 
Identification picture and the alteration of the OE 
picture was such that one experimental and one control 
stimulus were judged to be more similar to each other 
than to either of the other two stimuli. The purpose 
of these changes was to reduce the distinctiveness of any 
one of the 4 stimuli at .03 seconds and, at the same time, 
to equalize generalization tendencies between each 
critical stimulus and one of the control pictures at .20 
seconds. It was expected that the additional control 
of these stimulus properties would more nearly meet 
the conditions necessary for the test of the defense 
hypothesis and at the same time yield data relevant 
to the effect of stimulus similarity on the ease of dis- 
criminating the location of the stimuli in the defense 
series 

Examination of the data reveals that the change in 
stimuli did result in a more symmetrical distribution 
of responses by the groups in the base series. The 
respective means for the stimuli were: OE = 12.20;A = 
12.55; MG = 16.50; and OS = 14.75. This distribution 
of scores would seem to meet the basic conditions 
necessary for testing the vigilance and defense hypo- 
theses as outlined (2, 3). 

Analysis of the data yielded significant vigilance 
effect when evaluated by the é-test (¢ = 2.08; df = 19; 
05 > p > .02) although only 12 of 17 (3 equal) Ss 
conformed to the hypothesis. Only 9 Ss (of 20) con- 
formed to the defense hypothesis. These results are not 
statistically significant at the .25 level of confidence as 
evaluated by the sign test. Analysis of the data with a 
more efficient statistical test also failed to yield a 
significant defense effect (¢ = 1.60; df = 19;.20 > p> 
.10). 


TABLE 2 
MEAN FREQUENCY OF RESPONSE TO THE Four STIMULI 
IN THE DEFENSE SERIES OF EXPERIMENT II 





Condition A* 


OE MG OS A 


Females 4.00 4.20 16.20 7.60 
Males 4.00 3.50 17.00 7 


Condition B 


Females 8.00 15.70 4.30 4. 
Males 8.70 14.70 3.90 4. 





3s 





* Condition A refers to the fact that all Ss were attempting to 
locate the OS stimulus; this was the critica! picture for females and 
the neutral picture for males. In Condition B MG was the stimulus 
to be located; this was the neutral picture for females and the 
critical picture for males. 


The data relevant to the effects of the stimulus 
similarity variable on the ease of correctly identifying 
the position of the critical and neutral stimuli are 
presented in Table 2. The differences between the 
mean correct response to the OS and MG stimuli in 
no case (comparing males and females separately and 
as a group) reached the .20 level of confidence. How- 
ever, statistically significant generalization effects were 
obtained within each critical-control pair. The mean 
for stimulus A, under condition A, is significantly 
larger than OE or MG (.05 > p > .02) while the mean 
of OE, under condition B, is significantly larger than 
OS or A (.05 > p > .02). These results demonstrate 
that the degree of similarity among the four stimuli is a 
significant determinant of the Ss’ ability to identify 
correctly any one stimulus picture. 


Discussion 


The methodological implication of these 
results is quite clear. Variation in similarity 
among the four stimuli is an important de- 
terminer of the frequency of correct response in 
the defense series. Control of this variable is 
essential before any obtained differential re- 
sponse tendencies can be attributed to a 
perceptual defense mechanism. These results 
lend strong support to the hypothesis that a 
pre-experimental series of exposures at .03 
seconds duration, even under conditions where 
the stimuli elicit essentially equal frequency of 
response, does not serve as an adequate control 
for the relevant stimulus properties. The fact 
that appropriate control for stimulus prop- 
erties was not present in the original experi- 
ment (2) obviates the interpretation of these 
data in terms of a perceptual defense or re- 
pression mechanism.* Whether or not sys- 


* Additional evidence supporting the repressed 
impulse interpretation of “perceptual defense” has 
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tematic errors of localization beyond those 
attributable to stimulus similarity are obtain- 
able under these experimental conditions is a 
problem for future research. 

There is some question however whether it 
would be necessary to postulate an active 
avoidance mechanism (repressed impulse and/ 
or defense) in order to account for such results. 
Since the critical stimulus had been associated 
with anxiety, an alternative explanation in 
terms of the effects of anxiety on the general- 
ization gradient (9) seems to be an equally 
plausible interpretation. Data from the repli- 
cation experiment lend some tentative support 
to this hypothesis. The Ss who became sensi- 
tized to the critical stimulus (vigilance) tended 
to manifest a stronger generalization tendency 
between the critical-control pair of stimuli as 
compared to the neutral-control pair. Further 
experimentation is under way explicitly to 
test this hypothesis. 

The data from the two experiments reported 
here do offer support for the vigilance hypoth- 
esis. In the first experiment, 17 (of 22, 2 equal) 
Ss were more responsive to the critical stimu- 
lus in the vigilance than in the base series 
(p = .05). The results of the second experi- 
ment were not statistically significant accord- 
ing to the sign test; but a / test yielded a ratio 
of 2.08, which for 19 degrees of freedom is 
significant between the .05 and .02 level of 
confidence. The combined probability of both 
experiments is significant beyond the .05 level 
of confidence (x? = 10.84; df = 4). These 
results are consistent with other experimental 
evidence indicating that association of anxiety 
with previously neutral stimuli (7, 8) or 
anxiety arousal under certain experimental 
conditions (6, 12, 13) is related to an increased 
perceptual sensitivity to anxiety cues. 

It should be pointed out however that the 
increased responsiveness to the critical stimu- 
lus does not necessarily involve any “gratifica- 
tion of repressed impulses.” For example, it 
has been demonstrated that perceptual sensi- 
tization tends to occur if identification of the 
anxiety cue prevents further anxiety arousal 
(8) and under experimental conditions which 
tend to “accentuate” the anxiety producing 





been recently reported (1). The similarity between the 
criterion for repression as a characteristic defense 
mechanism and the method of testing the perceptual 
consequences of repression, however, make interperta- 
tion of the data equivocal. 
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stimulus (7). The anxiety-producing instruc- 
tions accompanying the critical stimulus, as 
compared to the neutralizing instructions 
associated with the other stimulus, may have 
served to produce such accentuation. The 
increased responsiveness could be attributed 
to the effects of differential learning resulting 
from the experimental sensitization experience, 
i.e., to the differential familiarity of the critical 
stimulus (4, 10, 11). 


SUMMARY 


The results of an initial attempt to replicate 
the results of Blum’s experiment (2) indicated 
that stimulus similarity was an important 
determinant of errors of localization in the 
test of the perceptual defense hypothesis. A 
second experiment was designed to test the 
defense hypothesis under the specified condi- 
tions and at the same time yield information 
concerning the role of stimulus similarity on 
errors of recognition in the defense series. 
The results indicated similarity among the 
experimental and control stimuli was the 
primary determinant of the frequency of cor- 
rect response. Therefore, it was concluded 
that relatively equal frequency of response to 
these stimuli at .03 seconds exposure duration 
is not an adequate control for the effects of 
stimulus properties on errors of localization at 
.20 seconds exposure duration. Further, since 
appropriate control of stimulus factors was not 
present in the original experiment (2), serious 
doubt arises concerning the perceptual defense 
interpretation of results obtained in that 
experiment. Evidence was presented which 
tentatively suggests that systematic errors of 
localization might be due to an increased 
generalization gradient associated with anxiety 
arousal. 

The data from both experiments reported 
here confirmed the findings with respect to 
the increased perceptual sensitivity to stimuli 
associated with anxiety (perceptual vigilance). 
It was pointed out, however, there is no 
necessary reason to attribute these results to 
repressed impulses. An alternative explana- 
tion in terms of possible differential learning 
under conditions of anxiety arousal was 
suggested. 
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HE problem investigated in the present 

study is the relation between the accept- 

ance of a role of authority and the ac- 
ceptance of a role of submission to authority. 
It is hypothesized that a tendency to use freely 
the authority of the former role is associated 
with a tendency to submit freely to the au- 
thority encountered in the latter role. Sub- 
stantiation of this hypothesis would indicate 
the existence of a personality variable which 
may be termed admiration of authority. Persons 
possessing a high degree of admiration for 
authority, by the present hypothesis, seek out 
positions of authority and freely exercise their 
authority over subordinates, but at the same 
time submit readily to authority in the hands 
of others. The pathological analogue of this 
type of personality structure has been per- 
ceptively analyzed by Fromm (3) in terms of 
sadism and masochism, while Adorno et al. (1) 
in their study of prejudice discuss six person- 
ality syndromes revealed by questionnaire and 
interview data, one of which, the authoritarian 
type, has as one of its defining characteristics 
an identification with strength, i.e., an admira- 
tion of authority. 

The role of parent was taken to represent the 
role of authority and that of citizen to represent 
that of submission to authority. The hypothe- 
sis of primary interest was then formulated as: 

Hypothesis 1. Jndividuals who demand (or 
who would demand) strict obedience from their 
children tend to accept demands for strict obedi- 
ence made on them as citizens. 

Two groups of items were constructed to test 
this primary hypothesis. The first of these was 
the POL group (Political—acceptance of 
authoritarian political policies), while the 
second item group was the DFO (Demands for 
Obedience) group. Two other item groups 
were constructed in order to explore two other 
areas judged to be relevant. The UPB (up- 
bringing) group was constructed from items 
concerning relatively factual aspects of the 
individual’s upbringing.’ By noting the correl- 

1A fourth item-group was constructed, which was 


intended to tap the respondent’s feelings of rebellious 
ness toward his parents and his home environment, but 


ation between POL and UPB, the following hy- 
pothesis is tested: 

Hypothesis 2. Individuals who, as children, 
accept demands for strict obedience from their 
parents tend to accept demands for strict obedi- 
ence made upon them as cilizens. 

Then by means of the correlation between 
DFO and UPB, a third hypothesis may be 
tested: 

Hypothesis 3. Jndividuals who, as children, 
accept strict demands for obedience from parenis 
tend to demand strict obedience from their 
children. 

These three hypotheses are central in the 
present study. However, as background data 
on respondents were also available, it was pos- 
sible to test additional hypotheses regarding 
differences in subgroup means. The five vari- 
ables used for this purpose were sex, age, oc- 
cupationa] status, father’s occupational status, 
and educational level. 


METHOD 


The present study is an analysis of part of the data 
collected in a more comprehensive survey of political 
attitudes, the Uppsala Survey,? conducted by the 
Sociology Institute of the University of Uppsala, 
Sweden, during the winter of 1951-52. The city of 
Uppsala has a population of about 60,000. As well as 
possessing the country’s oldest university, which has 
an enrollment of approximately 4,000, it is situated in 
an agriculturally fertile region, and also has a certain 
amount of industry. 

A simple random sample of adults (18 and over) 
was obtained, using the files of the city registrar’s 
office. Approximately 220 individuals were selected and 
from among these 194 were reached and interviewed 
Thirty-four interviewers were used, all of whom were 
Of these, 32 
interviewed 6 persons each and the remaining 2 inter 


university students studying sociology 


viewers contacted 1 person each. Each interviewer had 
received training in interviewing techniques and had 
passed an achievement test in this area. Names of 
persons to be interviewed were assigned at random. 


it made no contribution to the interpretation of the 
results, as a chi square test indicated that it did not 
discriminate between subgroups of the sample. This 
lack of discriminative power can probably be at 
tributed to the relatively strong wording of the items, 
which made them unacceptable to all but a small 
proportion of the respondents, and resulted in a highly 
skewed distribution of responses. 
2 Under the direction of Hans L. Zetterberg 
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Of the 194 persons interviewed, 144, or 74 per cent, 
were used in the study. There were 73 men and 71 
women. The remainder represent cases in which certain 
information was missing from the interview schedule. 
In almost all cases the missing data were responses to 
some of the items in the three item-groups POL, 
DFO, and UPB. In a very few cases the missing data 
involved background information on the interviewee— 
sex, age, etc. When more than two items in any item- 
group were unanswered, the interview schedule was not 
included in the tabulation. For those interview schedules 
in which not more than two items were unanswered in 
any item-group, an interpolative scoring procedure 
was used, the unanswered items being assigned inter- 
mediate scores. For the most part, missing responses 
probably represent inability of the 
make up his mind about an item. Interviewers were 
instructed to use some persuasion in eliciting responses, 
but not to insist. A few interviewees may have been 
unwilling to commit themselves on certain topics even 
though they were in all cases informed that their 
answers were to be used only for research purposes. 


interviewee to 


The items were initially worded in English by the 
author and then translated into Swedish by a Swedish 
sociologist who rephrased items, when appropriate, 
to fit the n The items 
were then presented for comments to the members of 


ational and local verbal habits 


the methodology class from which the interviewers 
were drawn and some items were reworded on the basis 
of the suggestions received. All items were then pre- 
tested on a small group of adults. The main change 
resulting from this pretest was to reduce the number of 
choices of answers to some questions from five to three 
so that all items were either dichotomous or trichot- 
omous in their final form 

The POL group contains four items concerning 
freedom of political expression, two about national 
honor, two more dealing with unquestioning obedience 
to the state, one about punishment, and a remaining 
item pertaining to restriction of group activities. The 
major subgroup within DFO consists of seven items 
about punishment of children. Another subgroup of 
four items deals with obedience of children to decisions 
of parents, while the remaining two items maintain 
that parents know best. The items in UPB all inquire 
about relatively factual matters of the respondent’s 
upbringing. Six items relate to freedom of activities as a 
child, while six others inquire about the respondent’s 
voice as a child in child-parent relations. Thus, POL 
contains ten items, DFO thirteen, and UPB twelve. 
High totalitarian attitude (POL), 
authoritarian child-rearing attitudes (DFO), or an 
authoritarian upbringing (UPB).* 


scores signify a 


RESULTS AND DISCUSSION 


The means, standard deviations, and test- 
retest reliabilities of the three item-groups are 


*The items are not given here, but have been 
deposited with the American Documentation Institute 
Order Document No. 4838 from ADI Auxiliary Publica- 
tions Project, Photoduplication Service, Library of 
Congress, Washington 25, D. C., remitting in advance 
$1.25 for microfilm or $1.25 for photocopies. Make 
checks payable to Chief, Photoduplication Service, 
Library of Congress. 


TABLE 1 


DISTRIBUTIONS AND RELIABILITY OF 
Item GROUPS 


SCORE 


Highest 
Possible 
Score 


Test-retest 
Reliability 


Item- 
group 


Mean SD 


3.91 73 


POL 2 11.62 
DFO 3 16.72 
UPB 8.76 


TABLE 2 
IreM-Group INTERCORRELATIONS 


Correlation Coefficient 


Corrected for 
Attenuation 


Item-groups 
Uncorrected 


POL-DFO 
POL-UPB 
DFO-UPB 


"5 < MM. 


shown in Table 1. Also shown is the highest 
possible score for each group. All groups ex- 
hibit means well removed from either end of the 
range of possible scores and the standard devi- 
ations indicate satisfactory spread of scores. To 
obtain an estimate of the item-group relia- 
bilities, 36 persons were reinterviewed at a 
later date with the same interview schedule. 
The obtained reliability coefficients are quite 
satisfactory for short instruments designed to 
detect group effects only. 

Table 2 shows the correlations between 
item-groups, both as computed directly and as 
corrected for attenuation. The correlation of 
primary interest is that between POL and 
DFO, as it is this which indicates the validity 
of Hypothesis 1. It is the largest of the correla- 
tions, being .33 uncorrected and .43 corrected 
for attenuation. A two-tail z-transformation 
test of significance shows this correlation to be 
significant beyond the .01 level of confidence. 
A correlation ratio was also computed from a 9 
by 10 scatter diagram which, when corrected 
for discontinuity, was equal to .36. A chi- 
square test of the significance of curvilinearity 
(4) showed the p value to be greater than .40 
and therefore the hypothesis of linearity of 
regression is not rejected. 

POL-DFO correlations within subgroups 
(sex, age, occupational status, father’s occupa- 
tional status, and educational level) ranged 
from .15 to .72 but two-tail z-transformation 
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tests of significance indicated that no difference 
between subgroup correlations were significant 
at the .05 level. 

Partialling out the effects of UPB yields a 
POL-DFO correlation of .30, indicating that 
upbringing as measured by UPB has little con- 
nection with political views outside that 
resulting from its communality with DFO. 
This finding is verified by the slight improve- 
ment in the prediction of POL scores resulting 
from using both DFO and UPB scores, the 
multiple r being .37. 

Turning to Hypothesis 2, it is supported by 
an uncorrected r of .22 between POL and 
UPB, significant at the .01 level. Hypothesis 3 
is also accepted on the basis of the DFO-UPB 
correlation of .21, also significant at the .01 
level. Thus all three of the major hypotheses are 
accepted at the .01 level of confidence. 

Two additional partial correlations were 
computed. The first shows that the association 
between POL and UPB is lowered from .22 to 
.17 when DFO is partialled out. Likewise, the 
association between DFO and UPB is lowered 
from .21 to .14 by partialling out the effects of 
POL. This shrinkage of correlations is inter- 
preted as signifying that Hypotheses 2 and 3 
are verified in part as a consequence of the 
POL-DFO correlation rather than as entirely 
independent relationships. 

Although all three intercorrelations between 
item-groups are significant, it is important to 
note that the magnitudes of the r’s are such 
that only a small fraction of the total variance 
of scores on one item-group is associated with 
the variance of the scores on either of the other 
two item-groups. Would an American sample 
have exhibited higher intercorrelations? Most 
likely not. More recently two groups of items 


TABLE 3 
LEVELS OF SIGNIFICANCE FOR DIFFERENCES IN 
SuBGROUP MEANS 
Item-group 
POL DFO UPB 
Sex 01 — 
Age 01 .05 01 
Occupational status ? .05* ° 
Father’s occupational 01 05* 
status 
Educational attainment 001 05 05 


* Nonmonotone means. 


which were quite similar to, and in some cases 
identical with, the items in POL and DFO 
were administered to a group of 100 under- 
graduates at the University of Wisconsin. 
Interestingly enough, the correlation obtained 
was .33—exactly that found between POL and 
DFO. It seems reasonable to assume that an 
American sample would yield about the same 
magnitude of correlations between POL and 
UPB, and DFO and UPB as did the Swedish 
sample. 

For any sample, it seems likely that the 
intercorrelations would have been higher if the 
roles invulved had been perceived by the 
respondents as belonging to a single hierarchy 
of authority. For example, every Army officer 
must take both the role of a superior (toward 
his subordinates) and of a subordinate (toward 
his superiors), and the two roles are clearly 
perceived as fitting into a single chain of 
authority. Under these conditions, it would be 
expected that free use of authority, i.e., strict 
demands upon subordinates, would be more 
closely associated with ready compliance to 
authority from above than in the case of the 
roles investigated in the present study. 

Some interesting differences in subgroup 
means were obtained when the sample was 
divided according to certain antecedent factors 
(Table 3). There were three categories for each 
factor other than sex. By far the most highly 
significant difference in means is that between 
levels of educational attainment on POL. The 
F ratio required to reach the .01 level of signifi- 
cance, with 2 and 141 degrees of freedom, is 
4.76 while that obtained was 21.63. The more 
school grades completed, the less political 
authoritarianism was shown. The more edu- 
cated groups also had significantly lower scores 
on DFO and UPB. Age too was influential, the 
older groups being significantly higher on all 
three item groups. A peculiarity of the occupa- 
tional status factor was that the means were 
nonmonotone, the middle group being the low- 
est on POL and DFO, but highest on UPB. In 
contrast, those whose father’s occupational 
status was relatively high were the lowest on 
POL and DFO, while the highest UPB mean 
was for the intermediate level of the father’s 
occupational status. The only significant sex 
difference was that men were lower on POL 
than were women. 

It is of importance that these antecedent 
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factors affect the item-group means appreci- 
ably, but have no significant effect on the cor- 
relation between POL and DFO, as mentioned 
earlier. This finding supports the view of a 
dynamic relationship between roles of domi- 
nance and those of submission, such as is taken 
by Fromm in his analysis, but does not disclose 
the specific nature of the relationship. 


SUMMARY 


Three groups of questionnaire items were 
constructed to measure authoritarian political 
attitudes, authoritarian child-rearing attitudes, 
and authoritarian aspects of upbringing. One 
hundred forty-four adults from a random 
sample in a Swedish town were interviewed. It 
was found that individuals who made strict 
demands as parents expected in turn that 
strict demands be made on them as citizens. 
correlations were also 


Significant positive 


found between authoritarianism of upbringing 
and authoritarianism of both political and 
child-rearing attitudes. These findings verified 
predictions which had been derived from three 
hypotheses concerning the nature of the rela- 
tionship between roles of authority and roles of 
submission. 
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THE NEGRO STEREOTYPE AND PERCEPTUAL ACCENTUATION! 
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Emory University 


HE tendency on the part of individuals 

in our culture to stereotype the Negro 

by attributing to him certain unfavor- 
able personality characteristics has been 
studied by social psychologists for several 
decades. Viewing this useful and familiar con- 
cept of the stereotype from the perspective of 
contemporary work in social perception, how- 
ever, raises several questions that have not 
been directly investigated. 

First, is the impression of personality traits 
obtained from a face controlled by such cate- 
gorical responses as, “This is a white man,” or 
“This is a Negro”? Are light Negroes and dark 
Negroes stereotyped to the same degree? Or, 
is there, instead, a quantitative relationship 
between the degree to which a Negro possesses 
“Negroid” physical characteristics and the 
extent to which stereotyped traits are at- 
tributed to him? Such a categorical response— 
assigning a set of traits to an individual be- 
cause of his membership—has generally been 
a part of the definition of a stereotype, al- 
though empirical data have seldom been 
presented to support this view in any direct 
fashion. 

Secondly, do anti-Negro individuals exag- 
gerate stereotyped personality characteristics 
in rating Negro photographs, and do pro- 
Negro individuals de-emphasize these char- 
acteristics, as compared with neutral judges? 
An affirmative answer is anticipated. Prej- 
udiced persons may be expected to rate Negro 
faces lower on favorable traits and higher on 
unfavorable traits; the reverse is likely to be 
true for pro-Negro persons. 

Finally, do anti-Negro and pro-Negro indi- 
viduals perceive the Negro as “‘more Negroid” 
in physical traits than an unprejudiced judge? 
This question involves an extension of Bruner 
and Postman’s (7) concept of accentuation, 
which has generally been described in opera- 
tional terms as an increase in perceived and 
remembered size occurring when stimuli are 
valued or need-related. In the present paper 


1 This is the eighth in a series of papers entitled, 
“Personalities in Faces.” 

2 Now at the University of Nevada. 

* Now at Washington University. 





the concept is extended beyond mere size to 
ratings of physiognomic variables used to 
characterize Negroidness as compared with 
Caucasian-like qualities. Although such vari- 
ables, unlike “size,” cannot be as readily 
expressed in physical terms, they are dimen- 
sions which might be expected to reveal 
accentuation, because of their close relation 
to a personal variable such as prejudice. 


METHOD 


Photographs. Fifteen Negro pictures, ranging from 
very light to very dark, were chosen from yearbooks 
of two Negro colleges in Atlanta, Georgia. These 
pictures were rated by a preliminary group of student 
judges on 15 physiognomic traits believed to be char- 
acteristic of Negroes. Ten photographs were retained 
for further use. Five white photographs were also 
chosen from yearbooks of two Southern universities. 
In addition to the attempt to obtain photographs which 
roughly ranged along a continuum of Negroidness, 
several other criteria were used for selecting pictures. 
All were full face, black and white photographs, ap- 
proximately the same size (about 2” « 2”), of young 
adult males. None had distinctive or unusual ap- 
pearances; e.g., photographs were omitted if the indi- 
vidual wore glasses, had a mustache, or unusual facial 
marks. Pictures of individuals who were definitely 
smiling, frowning, or exhibiting other strong signs of 
emotion were not included. Portions of the photographs 
showing clothing were removed. 

Subjects. Initially, 47 students, 15-18 years of age, 
were obtained from high schools in Atlanta, Georgia. 
A Likert-type scale of prejudice to be described later 
was administered to them. Upon tabulating their 
prejudice scores, very few of them were found to have 
scores at the end of the scale involving “favorableness 
toward the Negro;” consequently the group was 
augmented by additional subjects from a less prej- 
udiced population. 

An attempt was made to obtain these additional 
students from school systems in New York and New 
Jersey; however, school administrators refused per- 
mission to conduct the study because it involved Negro 
photographs. The extra subjects finally chosen were 
from a B’nai B’rith group in Atlanta. This is a Jewish 
organization which stresses tolerance and brotherhood, 
and thus is not strictly matched with the “anti-Negro” 
and the “neutral” groups of judges. Five girls and six 
boys from this group, 15-17 years of age, volunteered to 
participate. 

Rating scales. Two rating scales were developed for 
this experiment. In order to insure that trait-names 
would be expressed in language familiar to the rater, 16 
volunteer students were asked to list the physical and 
personality traiis that they, or people in general, usually 
associate with the Negro. The final choice of the 10 
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physiognomic traits used in the rating scale took into 


account these preliminary descriptions as well as 
physical characteristics of Negroes described in several 
anthropological references (3, 13, 15). Physiognomic 
traits were rated on a seven-point numerical scale, the 
ends of the scale being designated by appropriate ad 
trait 


labeled wide at the J end and thin at the 7 end 


jectives. For example, the width of nose was 

The 7-point personality rating scale consisted of 15 
traits chosen from the preliminary descriptions and 
Negro by the subjects in 
the early Katz and Braly 
stereotypes. Certain traits believed 


from the traits given the 
Gilbert’s repetition (9) of 
study (11 
to € 
pI 
the scale 
labels at the ends of the 


characteristic at either the 


of racial 


obviously unsuitable for rating from a facial 
1otograph, e.g., “ostentatious,” were not included in 


Personality traits were not given adjectival 


scale, but were labeled most 
1 end or 7 end, and leasi 
After the 
procedure, judges in the B’nai B’rith group were shown 
the pictures asked to identify which were 
Negro and which were White. This precaution was not 
exercised with the earlier groups 


Negro | 


scale. No existing scale of 
toward the 


characteristic at the opposite end rating 


again and 


atittude attitude 
Negro appeared to be entirely satisfactory 
I'wo common faults were that some items were dated 
by their d others 


discriminate adequately varying degrees of prejudice 


content, ar were too extreme to 


Consequently a composite scale of 18 items was com 


Adorno, et al i 
were ad 
Likert method 


of five response alternatives for eac h item 


piled from the attitude scales in 
Likert (14), and Hinckley (10).* 


ministered and scored by utilizing the 


These 


RESULTS 


Obtained ratings on the physiognomic scale 
were transformed directionally so that in each 
case a low rafing implied more Negroidness, 
and a high rating more Caucasian-like appear- 
ance. Ratings by each judge on the 10 physiog- 
nomic traits were then added to obtain a total 
Negroidness score for each photograph rated 
by him. As a test of the appropriateness of this 
procedure, each of the 10 items was correlated 
with the total Negroidness score. As may be 
seen from Table 1, eight of them correlated 
very highly with the total score. Indeed, the 
.97 between dark-light com- 
plexion and Negroidness suggests that the 
latter can be rated very accurately on the 
basis of a single-item scale. In data presented 
in the remainder of the paper, 
physiognomic Negroidness is based on a total 
score for the eight items, omitting cheekbones 
This ab- 


correlation of 


however, 


and mouth. score is 
breviated Neg. 


Next, the ratings for the 15 personality at- 


curve of 


‘The writers are indebted to Dr. Hinckley for 
furnishing his scale items in a personal communication 
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tributes were transformed directionally so that 
a low rating was in the direction of the Negro 
stereotype. On every trait, the mean rating 
for the white photographs was compared with 
the mean for the Negro photographs, using / 
tests. The prediction that Negroes would be 
rated than the following 
traits, alert expression, honest face, air of re- 
sponsibility, air of refinement, intelligent look, 
and thrifty, was confirmed in all instances 
(p < .05). Again in accordance with expecta- 
tions, Negroes were rated higher than whites 
on the following: superstitious, lazy, emo- 
tional, untidy, and immoral. Four other traits 
on which the Negro was expected to be rated 
higher were: happy-go-lucky and _pleasure- 
loving, on which there significant 
difference (p > .05), and warmhearted and 
religious, which were rated in the opposite 
direction. The 11 traits rated in the prediction 
direction (p < .05) were combined to obtain 
a total score for personality stereotyping 
(abbreviated Ster). 


lower whites on 


was no 


Our first hypothesis states that there is no 
difference in the degree to which Negro photo- 
graphs varying in physiognomic Negroidness 
(Neg) yield stereotyped personality impres- 
sions. That is, even if he has a Caucasian-like 
appearance, a Negro will be seen as having, in 
full degree, all the stereotyped traits usually 
attributed to the Negro. In order to test this 
hypothesis, the mean Neg score was obtained 
for all judges on each photograph. The mean 
Ster score was also obtained for each picture. 
These data are presented in Fig. 1. 

From the curve for the mean Neg scores, it 
may be seen that the 10 Negro photographs 


TABLE 1 
CORRELATIONS BETWEEN Eacu Puyst0GNomic TRAIT 
AND THE TOTAL PHYSIOGNOMIC RATING FOR 
EacuH PICTURE 


Physiognomic Traits 


Complexion (dark) 

Hair (coarse) 

Fullness of lips (thick lips) 

Skin texture (oily 

Nose (flat) 

Nose Ww ide) 

Hair (curly) 

Ears (small) 

Curve of mouth (corners turned down) 
Cheek-bones (prominent) 


*r = .64 at the .01 level, with NV = 15. 
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and the five white pictures become increasingly 
Caucasian, from left to right in the figure, 
and that the Negro photographs are continuous 
with the white, on this variable. The S/er score 
curve, on the other hand, does not at all follow 
the physiognomic one, but instead, has es- 
sentially a horizontal slope. Thus, personality 
stereotyping remains unchanged with in- 
creasing Caucasian appearance. In fact, data 
for pictures 9 and 10 indicate that the two 
whitest Negroes are as heavily stereotyped as 
the darkest Negroes, in spite of the probability 
that, if the photograph of the lightest Negro 
had been shown in a context of white photo- 
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graphs only, he most likely would not have 
been recognized as a Negro. 

The conclusion to be drawn from Fig. 1, 
then, is that, in the eyes of our high school 
subjects: “A ‘Nigger’ is a ‘Nigger,’ no matter 
how white he may be.” It should be noted 
that, according to their own statements, the 
subjects had no difficulty in identifying nine 
of the Negro photographs as Negroes, and 
only slight difficulty was reported in the case 
of the lightest Negro. Apparently, identifica- 
tion of the photograph as Negro is sufficient 
to yield the full Negro stereotype, regardless 
of how Negroid or how Caucasian the indi- 
vidual looks. Because of the possibility that 
prejudiced judges might show this effect to a 
greater extent than neutral ones, the 58 judges 
were divided into groups on the basis of their 
prejudice scores, as follows: ‘‘anti-Negro,” 
1.0-2.4; “‘neutral,” 2.5-3.4; and “pro-Negro,” 
3.5-5.0. Separate curves for the three groups 
of judges were found to be very similar to 
Fig. 1, indicating that all three groups respond 
categorically to Negro photographs. 

Our second hypothesis was that anti-Negro 
judges would exaggerate the personality 
stereotype of the Negroes, whereas the pro- 
Negro group would de-emphasize it. To test 
this hypothesis, the mean Ster score on the 10 
Negro photographs was determined for each 
group of judges. The location of these mean 
scores is in accord with the hypothesis, as 
Fig. 2 shows. Thus, the anti-Negro group 
intensifies the stereotyped characteristics of 
the Negroes, as compared with the neutral 
group. The difference between the neutral 
group and the pro-Negro group, however, is 
not sufficiently large to reach significance at 
the .05 level. 

Another way of testing the hypothesis is to 
correlate the Likert prejudice score for each 
judge with his Ster score for the Negro photo- 
graphs. An r of .31 (df = 56, p = .01) was ob- 
tained. Although this finding supports the 
hypothesis, its low magnitude suggests that 
self-ratings of prejudice and ratings of the 
personality characteristics from Negro photo- 
graphs are operations having quite different 
psychological significance. 

Finally, the less obvious hypothesis was 
tested that both anti-Negro and pro-Negro 
judges would perceive the Negro as more 
Negroid in physiognomic traits than would 
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Neutral 
JUDGES 
Fic. 3. MEAN INpiIcrs OF PHysIOGNOMIC NEGROIDNESS 
FOR ANTI-NEGRO, NEUTRAL, AND PRO-NEGRO GROUPS 


Anti-Negro Pro-Negro 


neutral judges, by a process of accentuation. 
From Fig. 3, it is clear that this hypothesis is 
upheld for the anti-Negro group, but not for 
the pro-Negro judges. The location of the 
mean Neg scores for the three groups of judges 
agrees with the hypothesis; however, only the 
difference between the anti-Negro and neutral 


group is significant at the accepted level. A 
further test of this hypothesis was made by 
computing a correlation ratio between the 
Likert prejudice score for each judge and his 
Neg mean, but this was not significant (eta = 


14, df = 56, p > .05). 


DISCUSSION 


Although responding to a member of a 
minority group categorical!y—assigning a set 
of traits to an individual because of his group 
membership—has generally been a part of the 
definition of a stereotype, empirical data have 
seldom been presented to support this notion 
in any direct fashion. Instead, it is usually 
implicitly assumed that if an individual ex- 
hibits prejudice toward Negroes on an attitude 
scale, he will express those feelings toward 
persons specified as Negroes, regardless of their 
characteristics. There is recognition, of course, 
that a Negro in a special face-to-face relation- 
ship to an individual may be differently per- 
ceived; e.g., one’s own Negro “Mammy,” but 
little attention has been paid to more general 
dimensions along which Negroes may differ 
and the perceptual variations they may pro- 
duce. Our data have been specifically concerned 
with the distinguishing physical characteristics 


of the Negro, and we have shown that in per- 
ceiving photographs which vary widely in 
these characteristics, judges with varying 
degrees of prejudice are alike in that they all 
respond categorically. This evidence con- 
stitutes empirical support for the classical 
concept of the stereotype as a categorical 
response. 

The low but significant relationship be- 
tween prejudice scores and degree of stereo- 
typing deserves further comment. Although 
both may be regarded as measures of prej- 
udice, the operations involved are apparently 
sufficiently different to give rather divergent 
scores. The following are suggested as being 
among those factors differentiating the two 
operations: (2) Endorsement of an anti-Negro 
scale item identifies the rater as prejudiced, 
whereas assignment of an unfavorable trait 
to a photograph is not clearly identified with 
prejudice; (6) In rating photographs, physiog- 
nomic cues other than racial determine, to 
some degree, the attribution of personality 
traits; (c) A prejudice scale involves a con- 
scious and deliberate effort on the part of the 
subject to indicate his attitude toward Negroes, 
whereas rating personality traits from faces 
appears to be an intuitive, effortless procedure. 

There is an obvious difference in content, of 
course, between rating faces on such traits as 
intelligent look, superstitious, etc., on the one 
hand, and stating whether or not Negroes 
should be segregated in schools, buses, 
churches, hospitals, recreational facilities, and 
places of employment, on the other. Some at- 
titude items, however, concern traits or quali- 
ties of the Negro and, therefore, are not ap- 
preciably different in content from trait ratings 
of faces, although as indicated above, the 
frame of reference within which ratings are 
made may be quite different for the two opera- 
tions. The use of ratings of Negro photographs 
as a means of assessing the degree of prejudice 
perhaps deserves further attention as an al- 
ternative to the more familiar attitudinal 
techniques of measurement. 

Laboratory experiments on accentuation 
have concerned themselves primarily, al- 
though not exclusively, with size. There is no 
reason why more complex social variables 
might not be studied in the same fashion. In 
fact Bruner (4) has suggested that preference 
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should be given to variables involving cues 
that are relevant to personality factors. 

Accentuation regarded in general 
terms as a process of perceiving valued objects 


may be 


in a manner which enhances their value. Value 
is enhanced through distortion of those dimen- 
sions of the object which are value-relevant. 
Thus, in the case of perceiving coins as slightly 
larger, size is thought to be value-relevant 
because larger coins, generally speaking, are 
worth more. In the case of symbols, such as 
the swastika, which has been used by Bruner 
and Postman (6) and Klein, Schlesinger, and 
Meister (12), size is not directly value-relevant 

there is no inherent reason why a larger 
swastika should have more or less value than 
a smaller one. As Bruner (4) has noted, how- 
ever, increment in value might be derived 
from the fact that, in our culture, higher value 
is generally attached to larger things. 

Taken as a whole, however, the empirical 
findings on size-accentuation are by no means 
entirely consistent or clear (cf. Allport, 2). In 
particular, the rationale for the relevance of 
size to value is of doubtful validity. The dimen- 
sions of physiognomic Negroidness in the 
present study, on the other hand, are clearly 
relevant, because the Negro is defined in 
physiognomic terms, and our value-measure is 
attitude toward Negroes. Negroidness is a 
negative value, and as such is accentuated or 
made more pronounced by our prejudiced 
judges. 

Some comment on the performance of the 
pro-Negro group is in order. In a number of 
experiments on_ size-accentuation 
12), both positively and negatively 
valued objects have sometimes been accentu- 
ated. From these studies it might be predicted 
that the pro-Negro group would likewise ac- 
centuate Negroid features. Our interpretation 
of this empirical prediction in terms of the 
dynamics involved is that, although pro-Negro 
subjects may have adopted “tolerance for 
minority groups” as an important value, this 
value has not been fully integrated into their 
basic personality structure. These subjects 
might thus be expected to “lean over back- 
wards” to avoid the appearance of prejudice, 
and hence are oversensitive to the identifying 
characteristics of a minority group. 

Although the pro-Negro group rated the 
photographs more Negroid than did the neutral 


previous 


(5, 6, 8, 
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group, this difference was not statistically sig- 
nificant. In this regard, a difficulty in the 
present design is the arbitrariness of identi- 
fying the “neutral” point of the prejudice 
scale. Conceivably, some of the pro-Negro 
judges were “really neutral,” so that the 
difference between the neutral and the pro- 
Negro groups as chosen was attenuated. The 
small size and the somewhat atypical nature of 
our pro-Negro sample prevent us from drawing 
certain conclusions. 


SUMMARY AND CONCLUSIONS 


A series of 15 photographs, 10 of Negro 
faces, and 5 of White faces, were rated by 58 
judges on two 7-point scales. One of these 
scales consisted of 10 physiognomic traits 
known to be characteristic of the Negro race, 
and the other, of 15 personality characteristics 
generally accepted as comprising the Negro 
stereotype. Ratings on the Negro and the 
White photographs were next compared for 
each trait. Those traits which failed to dis- 
criminate between the two sets of photographs 
were eliminated. For every photograph, two 
basic scores were obtained from each scale by 
adding together the scores for all judges on the 
remaining traits. Thus, a score for physiognomic 
Negroidness and for personality stereotyping 
was obtained. Negro photographs had been 
previously selected to insure a considerable 
range of variation in Negroidness. Prejudice 
scores were also obtained by administering a 
Likert-type attitude toward the 
Negro. 

The hypotheses tested and the findings 
obtained are as follows: 

The first hypothesis was that there is no 
difference in the degree of personality stereo- 
typing of Negro photographs varying widely 
in physiognomic Negroidness. That is, even if 
he has a Caucasian-like appearance, a Negro 
will be seen as having, in full degree, all the 
stereotyped traits usually attributed to the 
Negro. Supporting evidence was obtained: 
there was no decrease in stereotyping in moving 
from the most Negroid Negroes to the most 
Caucasian. The generally accepted but seldom 
tested definition of a stereotype as a categorical 
response to a member of a minority group is 
thus upheld. 

The second hypothesis was that anti-Negro 
judges exaggerate the personality stereotype 


scale of 
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of Negroes, whereas pro-Negro judges de- 
emphasize it. Comparison of the mean stereo- 
type ratings of judges put into groups on the 
basis of their prejudice scores (anti-Negro, 
neutral, and pro-Negro) lent significant sup- 
port to the hypothesis. Apparently, indicating 
degrees of agreement with pro-Negro or anti- 
Negro statements, as on the Likert scale, has 
something in common with rating Negro faces 
on valued traits—both may be regarded as 
ratings of prejudice. 

Finally, the hypothesis of perceptual ac- 
centuation was tested for physiognomic traits: 
both anti-Negro and pro-Negro judges were 
expected to perceive the Negro as more 
Negroid in physiognomic traits than do neutral 


judges. Comparison of the mean Negroidness 


scores for the three groups of judges supported 
the hypothesis, although the difference between 
the neutral group and the pro-Negro group was 
not statistically significant. This partial lack 
of support for the hypothesis might be due to 
the atypical nature of the pro-Negro sample, 
which was drawn from a different population 
than the other two groups, or from a difficulty 
in defining what is “neutral’’ and what is 
“pro-Negro” on a Likert-type scale. 
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THE RELATIONSHIP BETWEEN HUMOR AND THE EXPRESSION 
OF HOSTILITY! 


DONN BYRNE 


Stanford University, Veterans Administration Hospital, Palo Alto 


SYCHOLOGICAL literature is marked with 
reports of sporadic attempts to harness 
humor as a psychodiagnostic tool. Before 

this goal can be attained, it is necessary to 
discover and understand the part, if any, that 
humor plays in personality dynamics. As a 
step in this direction, the present study was 
undertaken in order to relate both humor 
preferences and the understanding of humor 
to other behavioral characteristics. 

The bulk of the experimental work dealing 
with humor has included the assumption, 
either explicitly or implicitly, that some sort 
of tension-releasing or drive-reducing process 
is involved in responding positively to humor- 
ous stimuli. This view of humor is consonant 
with psychoanalytic theory. Freud (6) postu- 


lated that the pleasure in what he calls 


tendency-wit results from the gratification of 


a tendency which would otherwise remain 
unfulfilled. Gratification is made possible 
because the censor is deceived temporarily by 
the humorous camouflage. Authors of later 
psychoanalytic writing similarly equate 
laughter and milder amusement with the 
release of repressed impulses and the conse- 
quent reduction of tension (2, 12, 14). 
Proceeding from the idea that humorous 
material can serve to release repressed im- 
pulses or reduce drives, several attempts have 
been made to utilize humor as an indicator of 
these hidden forces. In an informal and ex- 
ploratory study, Redlich, Levine, and Soller 
(11) used cartoons as a device for obtaining 
clues to basic needs and conflicts among 
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neuropsychiatric patients and normals. Their 
observations led them to believe that a posi- 
tive response to humor indicated the release 
of suppressed or repressed needs but that 
responses of indifference or dislike did not 
necessarily imply an absence of such needs. 
Murray (10), using paper-and-pencil ques- 
tionnaires, assessed the aggressive sentiments 
and aggressive conduct of university students. 
He found that the former was highly related to 
the appreciation of disparaging jokes while 
the latter was not. From these results and the 
study of autobiographical material, he con- 
cluded that laughter at derisive jokes was the 
consequence of repressed hate. 

A somewhat more inclusive approach is 
represented by Cattell, Luborsky, and 
Saunders (1, 3, 4, 9), who employed a factor- 
analytic technique. Their original intention 
was that of designing a humor test which 
would reveal an individual’s major repressions 
by those jokes which he found amusing. To 
their surprise, a positive relation was con- 
sistently found between the vector measured 
by the joke cluster and the overt personality 
trait as defined by the Guilford-Martin ques- 
tionnaires. Cattell resolved this apparent 
conflict between theory and findings by as- 
suming that the deviation of overt and covert 
personality from the approved social norm is 
proportional. For example, the individual who 
frequently behaves in an overtly hostile 
manner also has more hostility hidden beneath 
the surface than the one who fails to manifest 
his hostility. 

It appears that various investigators, ap- 
proaching the study of humor with hypotheses 
based on Freudian theory, have found contra- 
dictory or inconclusive results, but have been 
loath to reject the theory. 

The present study was designed to investi- 
gate the relationships among (a) behavior 
ratings of expression of hostility, (6) the extent 
to which hostile cartoons are judged to be 
amusing, and (c) the ability to recognize that 
the cartoons contain hostility. Freudian theory 
would seem to predict a negative correlation 
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between expression of hostility and finding 
hostile cartoons amusing while previous re- 
search findings would tend to suggest a positive 
one. The purpose of this study was to help 
resolve this conflict. 


METHOD 


In order to secure a sizable sample of cartoons to 
serve as humorous stimuli, over 230 cartoons were re- 
moved from several back issues of the Saturday Evening 
Post and Collier’s. These cartoons were presented to 
five psychologists and graduate psychology interns to 
be judged as expressing or not expressing interpersonal 
hostility. All five judges agreed on 80 of the cartoons, 
with 40 labeled hostile and 40 nonhostile. From this 
pool, 32 were selected in such a way as to form a hostile 
set of 16 matched as closely as possible in content with 
a nonhostile set of 16. The hostile 
cartoons were combined in random order to form the 
complete series 

The Ss consisted of male neuropsychiatric patients 
at the Veterans Administration Hospital, Palo Alto, 
California. On separate admission wards, the 
psychiatrist, head nurse, and chief attendant inde- 
pendently rated a total of 89 patients on their typical 
manner of expressing hostility. All of the patients rated 
had been on the ward a minimum of six weeks and 
had been in close contact with the raters during this 
period. Each patient was placed in one of three groups 
according to definitions adapted from a rating scale 
developed by Schultz (12). These definitions were as 
follows 

“A patient classified as overily hostile is one who 
‘acts out’ aggressively in response to tension producing 
and frustrating situations. This aggression may be 
expressed either physically or verbally. The overtly 
hostile patient usually sees himself as being in the 


and nonhostile 


two 
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position of having to defend himself by hostile and 
aggressive actions. He will often interpret members of 
the hospital staff as being very threatening and will 
react by physically or verbally attacking staff members 
directly or by running down the hospital policies, 
regulations, and activities in general. 

“A patient classified as covertly hostile is one who 
expresses his hostility in a subtle, indirect, insidious, 
or passive manner. That is, regardless of whether the 
person is aware or unaware of the hostile implications of 
his behavior, he is in a position of not having to accept 
the responsibility for it. Some activities which might 
be seen as examples are: dumping cigarette ashes on the 
floor with an ash tray near by, frequently being late for 
ward activities and appointments, interrupting a person 
who is talking and then apologizing, or frequently being 
involved in accidents involving the belongings of others. 

“A patient classified as nonhostile is one who does not 
fall in either the overtly hostile or covertly hostile 
categories. As far as can be observed, he does not act 
out his aggressive impulses in either overt or covert 
behavior. An extremely nonhostile patient might be 
described as the ‘Caspar Milquetoast’ type. Every 
effort is made to comply with all the demands and 
wishes of those around him, especially with the rules 
and regulations of the hospital.” 

An S was assigned to one of the three hostility 
groups if two of the three raters egreed on the grouping. 
Out of 89 patients rated, only six had to be eliminated 
because of failure to meet this criterion. From the re- 
maining 83 patients, 45 Ss were randomly selected, with 
15 in each of the three groups. 

Table 1 indicates the characteristics of the 45 
experimental Ss. Because the Ss of the nonhostile 
group differed markedly from those of the other two 
groups with respect to age, product-moment correla- 
tions between age and the two experimental measures 
were computed. Neither relationship was statistically 


TABLE 1 


AGE, LENGTH OF HOSPITALIZATION, AND DIAGNOSIS OF THE EXPERIMENTAL SUBJECTS 


Experimental Group 





Characteristic 


Range 
Median 
First and third quartiles 
Length of Hospitalization (months) 
Range 
Median 
First and third quartiles 
Diagnosis 
Schizophrenic disorder 
Psychoneurotic disorder 
Organic brain damage 
Psychosis, unclassified 
Character and behavior disorders 
Paranoid disorder 
Affective disorder 
No diagnosis available 


30 and 46 


Covertly 
hostile 


Overtly 


hostile Nonhostile 


‘ 22-55 31-72 
34 3 41 
29 and 38 37 and 56 


3-79 4-54 2-386 
7 14 11 
6 and 20 10 and 27 5 and 61 7 and 26 


11 
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significant (Age with Hostile Humor Score, r = —.05, 
p > .05; Age with Hostility Judging Score, r = —.25, 


p > .O5). In length of hospitalization there was a 
tendency for those who had been hospitalized the 
shortest time to express their hostility more overtly, 
but this variable was not significantly related to either 
experimental measure (Months of Hospitalization with 
Hostile Humor Score, r = .04, p > .05; Months of 
Hospitalization with Hostility Judging Score,r = —.25, 
~ > .OS). The best represented diagnostic category 
was that of schizophrenic reactioa, with over half of 
these Ss subclassified as paranoid type. There was a 
preponderance of this category in the covertly hostile 
group, but the mean differences on the experimental 
scores between the schizophrenic Ss and those in the 
other diagnostic groups combined were not significant 
(Hostile Humor Scores, ¢ = .40, p > .05; Hostility 
Judging Scores, ¢ = .51, p > .05) 

Each S, tested individually, asked to place 
the 32 cartoons in four equal piles according to how 
funny they were to him. The piles were labeled as 
“most funny,” “next,” “next,” and “least funny.” For 
the second task, each S was given the same cartoons 
in the original order and asked to divide them into two 


was first 


equal piles. In one pile were placed those cartoons in 
which “insults, anger, teasing, or meanness” was ex 
pressed, and in the other pile those in which these 
factors were not present. In effect, he was informed 
that half the cartoons contained hostile material; his 
task was to divide the 32 cartoons appropriately. The 
entire testing procedure lasted from 20 to 40 minutes. 
The reliability of the hostility-judging task performed 
by the Ss was .86 as computed by the split-half method 
corrected by the Spearman-Brown formula 


RESULTS 

The three groups were compared on their 
ratings of the funniness of the hostile cartoons. 
For every S, three points were assigned for 
each hostile cartoon rated most funny, two 
points for each rated next most funny, etc. 
The total number of these points for each S 
his Hostile Humor Score. The 
means and standard deviations of the resulting 
A simple 
of the group 
means failed to show significant differences 
(F 2.27, p > .05). However, when the Ss 
were divided into two groups, those who ex- 


constitutes 


distributions are given in Table 2. 
one-way analysis of variance 


press hostility either overtly or covertly and 
those who do not express hostility, a significant 
difference between groups was found (¢ = 2.11, 


hostile cartoons funnier than do those Ss wno 


.05). Those Ss who express hostility find 


do not. 

Another complication arose in analyzing 
the second task. It was found that the number 
of cartoons correctly judged to express or not 
to express hostility (designated as the Hostility 
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Judging Score) correlated significantly with 
IQ as estimated by four subscales of the 
WAIS (r = .75, p < .01). These intelligence 
scores were available for only 10 of the Ss 
in each group. It should be noted that the 
correlation between the Hostile Humor Score 
and IQ was not significant (r = —.10, p > .05). 

Table 3 reports the means and standard 
deviations of the Hostility Judging Scores of 
the three groups. An analysis of variance 
revealed no significant group differences in the 
mean number of cartoons judged correctly 
(F = 1.28, p > .05). With intelligence con- 
trolled by the covariance method, as shown in 
Table 4, the three groups still did not differ 


TABLE 2 
COMPARISON OF GroUP DIFFERENCES IN THE HuMOR 
RATINGS OF HosTILE CARTOONS 





Humor Ratings of the 
Group _ Hostile Cartoons 
N M SD 
Overtly hostile 15 25.33 2.33 
Covertly hostile 15 24.93 3.57 
Nonhostile 15 23.00 3.27 
Overt and covert groups 30 25.13 3.02 
combined 
TABLE 3 


COMPARISON OF GrouP DIFFERENCES IN JUDGING THE 
HostT1LE CONTENT OF CARTOONS 
Correct Judgments of 
Cartoon Content 





Group 7 its " 
N M SD 
Ov ertly hostile 5 22.93 5.31 
Covertly hostile > 21.47 4.92 
Nonhostile 15 19.47 6.83 
TABLE 4 


COMPARISON OF GrouP DIFFERENCES IN JUDGING THE 
HostirLE CONTENT OF CARTOONS WITH 
INTELLIGENCE CONTROLLED 
BY COVARIANCE 


aa —— 
Correct Judgments Estimated 
. of Cartoon Content IQ 
Group 
N M SD M SD 
Overtiy hostile 10 | 23.80 83 | 104.30 | 16.08 
Covertly hostile 10 21.20 | 4.75 92.40 10.47 
Nonhostile 10 18.20 | 5.69 95.50 | 16.33 
Overt and covert groups | 20 22.50 47 98.35 | 14.82 
combined 
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significantly on the task (F = 2.91, p > .05). 
Again, however, when the Ss are divided into 
expressers and nonexpressers of hostility, 
significant differences occur (F = _ 5.82, 
p < .05) when IQ is controlled by covariance. 
Those Ss who express hostility are better able 
to recognize hostility in cartoons than are the 
Ss who do not express it. 

The only other finding of note was that the 
Hostile Humor Scores and the Hostility Judg- 
ing Scores not significantly related 
(r = .09, p > .05) unless the effects of IQ were 
partialled out (riz-1e 37, Pp < .05). In view 
of the this 
surprising. 


were 


previous findings, result is not 


DISCUSSION 


The findings reported here appear to cor- 
roborate previous experimental results and 
to contradict psychoanalytic theory in that a 
positive relationship was obtained between 
the type of humor found amusing and overt 
personality characteristics. Cattell’s suggested 
resolution would explain the present findings 
by hypothesizing that those Ss who express 
more hostility also had more underlying, un- 
expressed hostility, and it was the latter factor 
which determined the response to the cartoons. 
Other possibilities exist, however, and it seems 
reasonable to explain the findings without 
depending on postulated repressed needs. 

Another thesis to account for the rather con- 
sistent experimental results can be stated in 
Hullian terms. A positive response to a par- 
ticular humorous theme may be conceived as 
a function of both drive and habit strength and 
not, as other investigators have hypothesized, 
a function of the strength of repressed drives 
only. From this viewpoint, the appreciation of 
hostile cartoons is seen simply as an expression 
of hostility by individuals who also express 
hostility in situations. From Hull’s 
views (7), it that individuals with 
strong aggressive drives who experience drive 
reduction by expressing hostility in a given 
situation have increased habit strength for the 
expression of hostility in that situation. Since 
Feshbach (5) found that aggressive drive is 
partially reduced by its expression in fantasy 
material, it seems likely that the potential 
drive-reducing situations include fantasy and 
humor. An additional postulate (7) is that 
habits are evoked, on the basis of similarity, 


other 


follows 





AND THE EXPRESSION OF HOSTILITY 87 


by stimuli other than the original ones. There- 
fore, the situations in which hostility is ex- 
pressed could be arranged on a gradient of 
generalization. As an example to illustrate 
both postulates, it would be predicted that 
individials who were allowed to express 
hostility toward their fathers without punish- 
ment would express hostility toward other 
authority figures, would enjoy stories in which 
such hostility was expressed, and would be 
amused by humorous material with this same 
theme. 

Another question, and one not answered 
by the present study, is whether Ss who express 
hostility necessarily possess the highest ag- 
gressive drives. One possible answer comes 
from Holzberg, Bursten, and Santiccioli (8), 
who found that Ss who express either more or 
less than average aggression in fantasy have 
strong aggressive tension. Perhaps the non- 
hostile Ss in the present study had as strong 
aggressive drives as the overtly and covertly 
hostile Systematic behavior theory 
would account for the difference in present 
behavior on the basis of punishment following 
the non- 


ones. 


the early aggressive behavior of 
hostiles. 

While the instrument reported in this study 
yielded significant group differences, its value 
as a predictor of individual behavior is low. 
However, these various findings and tentative 
generalizations suggest that it may be possible 
to construct longer and greatly refined humor 
tests in which the degree of positive response 
to hostile humor would measure habit strength 
for the expression of hostility. A second sug- 
gestion is that the degree of either positive or 
negative response to the humor would be a 
measure of drive strength. Further, one might 
logically expect these same relationships to 
hold for other drives besides aggressive or 
hostile ones, so other appropriate humor tests 
could be devised. One last possibility is that 
humor tests could be made very specific as 
indicators of particular habits in particular 
situations. 

The finding that those Ss who express 
hostility are better able to recognize it in the 
cartoons remains to be explained. If both 
hostile and nonhostile Ss have strong aggres- 
sive drives, it could be postulated that the 
former are more sensitive to stimuli which 
offer a reduction in their drive state, whereas 











ies) 


the latter are less motivated to recognize ag- 
gressive stimuli because response to such 
stimuli has not been rewarded in the past. If 
the nonhostile Ss are actually avoiding aggres- 
sive cues (instead of merely being less sensi- 
tive), this phenomenon amounts to what other 
investigators have designated as perceptual 
defense against threatening stimuli. 

The positive correlation between intelligence 
and the ability to identify correctly the hostile 
and nonhostile cartoons points to the possi- 
bility of utilizing humor to a greater extent 
than is presently done in the measurement of 
intelligence. The task involved in this research 
required the understanding of either the point 
of each cartoon or the interpersonal situation 
depicted. In other words, the Hostility Judging 
Scores seem to represent, in part, an index of 
the ability to carry on abstract thinking. The 
use of cartoons as an intelligence test subscale 
may prove useful if they provide a reliable and 
economical measure. It should be noted, 
though, that with the cartoons used in this 
study, a ceiling IQ of about 120 was found. 
Consequently, humor as an intelligence meas- 
ure may only be useful in rough screening 
devices. Since both intelligence and personality 
dynamics were found to be related to the task 
of identifying hostile cartoons, any successful 
attempt to construct either a diagnostic device 
or an intelligence scale with humor as a base 
must avoid the confounding of these two 
factors. 

A word of caution about the experimental 
results seems in order. It is undoubtedly de- 
sirable to present any experimental findings 
simultaneously with cross-validation material. 
At the very least, there should be some as- 
surance from the investigator that the meas- 
ures which he is using show some stability over 
time. In practical situations the fulfillment of 
these ideals is not always immediately possible; 
the present study is an example. Nevertheless, 
the experimental results were statistically 
significant, tend to be compatible with previous 
findings, and may serve to stimulate further 
work in the area. 


SUMMARY AND CONCLUSIONS 


This study was undertaken in order to help 
resolve the conflict between Freudian theory 
and previous experimental findings by ex- 
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ploring the relationships among (a) the expres- 
sion of hostility in behavior, (6) the extent to 
which hostile cartoons are judged funny, and 
(c) the ability to recognize hostility in cartoons. 

Three groups of male neuropsychiatric pa- 
tients were rated as expressing hostility overtly, 
covertly, or not at all. They were asked to 
judge the funniness of 32 cartoons and also to 
indicate which of the cartoons expressed 
hostility and which did not. It should be 
emphasized that the generality of these findings 
can not be known until the study is cross 
validated with other Ss. 

It was found that: 

1. Those Ss who frequently express hostility, 
either overtly or covertly, find hostile cartoons 
significantly more amusing than do those Ss 
who fail to express hostility. 

2. There is a significant positive correlation 
between ability to differentiate hostile and non- 
hostile cartoons and estimated IQ, but there is 
no significant correlation between IQ and find- 
ing the hostile cartoons amusing. 

3. Those Ss who frequently express hostility, 
either overtly or covertly, are significantly 
better able to differentiate hostile and non- 
hostile cartoons than are those Ss who fail to 
express hostility, if the effects of intelligence 
are statistically controlled. 

4. There is a significant positive correlation 
between ability to differentiate hostile and 
nonhostile cartoons and finding hostile cartoons 
amusing only if the effects of intelligence are 
partialled out. 
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DISTRIBUTIONS OF TRAITS IN CURRENT Q-SORT METHODOLOGY: 
AUSTIN JONES? 


Central State Hospital, Indianapolis 


NUMBER of investigations within the 
last few years have used the method 
referred to as Q sort for research in 
the areas of personality and psychotherapy. 
The method has been drawn from the group of 
related inverted factor-analytic procedures 
known as Q technique, dealt with most ex- 
tensively by Stephenson (11). In the initial 
study by Hartley (5), the Q sort was used to 
study changes in the client’s ‘‘self concept” and 
“ideal self” and the relationship between them 
in the course of therapy. Butler and Haigh (1), 
in a recent study, investigated the same rela- 
tionship. Rudikoff (10) studied the relation- 
ships between concepts of self, “ordinary 
person,” and the “‘ideal”’ at several points in 
time relative to the therapeutic process. In 
addition to these researches, there have been a 
number of others using Q sort in the investi- 
gation of closely related topics (3, 4, 6, 8, 9, 12). 
Common to all of the studies cited has been 
the characteristic Q-sort procedure which re- 
quires the S to sort a number of statements of 
self-reference into a series of piles along a 
continuum of appropriateness or accuracy of 
self-description, from those that are “least 
like” him to those that are ‘‘most like” him. 
The number of items sorted into each pile is 
specified in such a way that the frequency 
distribution of items in the piles along the 
continuum approximates the normal distribu- 
tion. The items may be selected in a number of 
ways but in each case are presumed to sample 
randomly a previously defined ‘‘trait universe.”’ 
The practice of forcing a distribution (in this 
case, a quasi-normal distribution) was adopted 
for several reasons. Perhaps the more im- 
portant ones are that such a procedure acts to 
reduce response sets and that variance in the 
1 The author wishes to express his appreciation to 
Dr. Morton Wiener, Chief Psychologist of Central 
State Hospital, for his advice and encouragement in the 
execution of this study and to Dr. C. L. Williams, 
Superintendent of Central State Hospital, and Dr. 
Ruth B. Hines, Chief Psychologist of Veterans Ad- 
ministration Hospital, Indianapolis, for permission to 
utilize patients as research subjects. The author 
indebted to James Norton of the Statistical Laboratory, 
Purdue University, for statistical help and advice 
2 Now at The University of Rochester 
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responses is automatically obtained. It is also 
a procedure of some statistical convenience, as 
the means and standard deviations of all Ss’ 
distributions become automatically the same 
and thus greatly simplify computation of the 
product-moment correlations involved in each 
case. Some limitations inherent in this pro- 
cedure have already been noted. Cronbach, for 
instance, has pointed out the loss of informa- 
tion that may occur when individual differ- 
ences in shape of distribution are suppressed. 
Forced-distribution procedures, he comments, 

..may be psychologically indefensible, if 
there is reason to think that persons differ in 
their variability over traits” (2, p. 379). 

Since Cronbach’s note of caution there has 
been little if any change in the Q-sort meth- 
odologies in published research. The studies 
cited earlier, with the exception of Hartley’s, 
have all appeared since Cronbach’s article, 
thus leading to the impression that the authors 
have taken the position that there are no sig- 
nificant differences in the shape of distributions 
of traits among individuals. 

The characteristic practice of forcing a quasi- 
normal distribution would seem to imply 
further that the characteristic shape of dis- 
tribution of traits within individuals is also a 
quasi-normal distribution. These statements do 
not apply only to the set of trait statements 
but also, since they are presumed to be random 
samples from the trait universe, to the dis- 
tribution of the trait universe as well. 

The formulation that has been inferred from 
current Q-sort procedures may be stated more 
specifically as follows: The distribution of the 
degree of development of the universe of traits 
within individuals is a quasi-normal distribu- 
tion; although relative development of differ- 
ent traits may vary from time to time, the 
degree of development of traits within indi- 
viduals remains alwaysa quasi-normal distribu- 
tion. In other words, the strength or predomi- 
nance of a given trait may wax and wane in the 
individual’s life, but the distribution 
strength of traits (operationally, the degree of 
their seif-descriptiveness) remains constantly 


of 


quasi-normal. 








TRAIT DISTRIBUTIONS 


Che purpose of the present study is to sub- 
ject this implicit theoretical formulation to 
experimental verification and to assess the 
appropriateness of the corresponding statistical 
procedure of the forced quasi-normal distribu- 
tion. (It should be noted that there are at least 
two antecedent assumptions of Q-sort pro- 
cedures, each of which may merit study. One is 
the assumption that the Q-sort items randomly 
sample the trait universe. The second is the 
assumption that the Ss are able to report 
meaningfully or accurately their perception of 
the degrees of self-reference of the Q-sort 
items. For the purposes of this study, these as 
sumptions are provisionally accepted.) 

Within the framework of the theoretical 
position stated above, it may be hypothesized 
that individuals, given statements which are 
presumed to sample the total population of 
traits and allowed to indicate what they feel to 
be the relative appropriateness of self-descrip- 
tion of the statements, will assign values in 
such a way that their frequencies will closely 
approximate the normal distribution. It is 
with the testing of this hypothesis that the 
present paper is concerned. 

\ “free-sort”’ procedure, i.e., one in which 
the distribution is not specified, would be ex- 
pected to give evidence of, among other things, 
the range of meanings with which the indi- 
vidual is able to interpret himself in relation to 
the world or by which he perceives himself. 
This is inherent in the task of rating the ap- 
propriateness of self-description of items. There 
seems to be no readily apparent reason why 
individual differences would not manifest 
themselves along this as well as other persona- 
ality variables. Thus, the first of the two 
predictions of this study is: Ss will show signifi- 
cant variation among themselves in regard to 
shape of distribution sorted, and, concom- 
itantly, Ss will show significant variation in 
shape of curve from the quasi-normal curve 
predicted by the theoretical position described 
earlier. 

Maladjusted individuals have been described 
from time to time as suffering from distortions 
and disturbances in the variety of meanings 
with which they can interpret themselves and 
situations about them. A recent statement of 
this view has been made by Osgood and Luria 
(7). It seems consistent with such theory to 
hypothesize that maladjusted individuals in a 
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“‘free-sort”’ procedure would manifest a tend- 
ency to deny appropriateness of self-description 
of items and to dichotomize items in such a 
way that only a few degrees of appropriateness 
of self-description are admitted. The second 
prediction of this study, then, may be stated 
formally as: Maladjusted Ss will show greater 
deviation in shape of distribution from the 
norma! distribution than will less maladjusted 
or “normal” Ss, with greater degrees of mal- 
adjustment showing greater degrees of devi- 
ation. 


METHOD 


Subjects. Three groups of ten Ss each were chosen 
to represent three rather widely varying degrees of 
personality integration or adjustment. The first group 
was drawn on a volunteer basis from the male students 
taking a required introductory college course in 
psychology. The second group was drawn randomly 
from the population of male neurotic patients receiving 
treatment at a Veterans Administration hospital. 
Patients with organic involvement or mental deficiency, 
with prior history of psychosis, over 36 years of age, 
or receiving insulin or electroshock within the six weeks 
prior to testing were excluded. Anxiety neurosis was 
the primary classification. The third group was drawn 
from the population of male patients at a state mental 
hospital, each of whom would be commonly regarded as 
psychotic. These patients were selected as they were 
referred routinely for psychological evaluation. Pa- 
tients grossly out of contact, over 36 years of age, with 
organic involvement or mental deficiency, and those 
having received insulin or electroshock treatment 
within the previous six weeks were excluded as Ss. 
Schizophrenic disorders were predominant 

The groups were equated for age and education as 
closely as the populations permitted. The mean ages 
for the college, neurotic, and psychotic groups were 
24.0 years, 28.8 years, and 29.7 years respectively. The 
mean educational level for the three groups (highest 
grade completed) was 12.8, 11.3, and 12.2. 

Materials. The materials used were the 100 items of 
the SIO Q sort developed by Butler and Haigh (1). 
This particular set was selected because of its frequent 
and systematic use in recent research. The items are 
statements of self-reference typed on individual cards. 
Sample items: “I am a dominant person.” “TI am liked 
by most people who know me.” “It’s hard to control 
my aggression.’ 

Procedure. All Ss participated on a volunteer basis, 
their right to decline having been clearly expressed. 
The Ss were informed that the results of their per- 
formance would have no bearing on course work in the 
case of the college Ss, or upon hospitalization or type 
of treatment in the case of the neurotic and psychotic 
groups. In the course of testing, one S from the neurotic 
population and two from the psychotic population 
declined to complete the task and were necessarily 
excluded as Ss. 

All Ss were seen individually by the author. The 
Ss were asked to sort the 100 items as to their degree 
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of appropriateness of self-description on a 9-point scale 
of equal-appearing intervals. Blue cards numbered one 
through nine were placed in 2 row on the table surface 
to indicate the ascending order of points on the con- 
tinuum. The instructions were those used by Butler 
and Haigh and others with the SIO Q sort, plus addi- 
tional instructions for the equal-appearing intervals 
aspect of the task. A standardized set of gestures was 
used concurrently with the verbal instructions for the 
purpose of making the scaling procedure as clear as 
possible. The Z presented the following instructions 
orally. The portion italicized is the basic instruction 
used by Butler and Haigh. 

“Here is a set of cards which have on them state- 
ments about how people think and feel. Your task is to 
sort these cards to describe yourself as you see yourself 
today from those that are least like you to those that are 
most like you. Sort them into nine piles. In the first pile 
put the statement—or statements-—that describes you 
least well, that is least like you. In the ninth pile put 
the statement—or statements—that describes you 
best, that is most like you. In the piles in between, 
arrange the other statements so that each pile describes 
you better and better, and so that the distances between 
them are even. That is very important. Try to make 
the distances between the piles as even as possible, so 
that the statements in this pile [the second] are more 
like you than in this one [the first]—and those in the 
next pile [the third] are by the same amount more like 
you than in this one [the second]|—and those in this pile 
[the fourth] are by the same amount more like you 
than in this one [the third)—and so on. So that the 
increase from pile to pile is the same. Do you under- 
stand? 

“Now in each pile you must put at least one card. 
Beyond that you may put as many or as few as you 
wish in any pile—even none. 

“Are there any questions?” 

Where information regarding reading ability was 
incomplete, Ss were requested to read sample items 
aloud. Two Ss from the psychotic population were 
found to have only minimal comprehension of the items 
and were thus excluded from the study. The Ss were 
informed that they might ask questions as they went 
along. By far the majority of the questions concerned 
the meaning of individual words. In such cases E 
encouraged verbalization of S’s impression of the word. 
When no reaction was forthcoming or when the word 
was badly misunderstood, E supplied as standardized a 
meaning as possible. 

In order to ascertain whether or not the Ss were, in 
fact, using a 9-point scale of equal-appearing intervals, 
E conducted a brief postexperimental inquiry with each 
S after he had completed the sorting. The following 
question or a variation of it was asked: “What was it 
about the task that you found most difficult?” In only 
one case was the equal-appearing intervals aspect cited. 
With further inquiry, this S indicated that while he 
found it the most difficult part, he nevertheless felt 
that he had mastered it adequately. The other Ss were 
then queried specifically concerning the equal-appearing 
intervals aspect. All Ss reported their confidence on this 
point. When E still felt any doubt, S was further en- 
couraged to verbalize the scaling principle as a final 
check. In each case he responded adequately. 


RESULTS 


A comparison of the shape of each S’s 
distribution with that of the approximate 
normal distribution customarily imposed in use 
of the SIO Q sort was made by use of the chi- 
square test of goodness of fit. The number of 
items falling at each of the nine classes for each 
S’s sorting constituted the obtained frequen- 
cies. The numbers of items customarily re- 
quired in each class for the approximate normal 
distribution were the “expected” frequencies. 
Significant differences between obtained dis- 
tributions and the approximate normal dis- 
tribution were found for all Ss in each of the 
three groups; # in all cases was less than .01. 

The composite (or average) distributions for 
each of the three groups are shown with the 
approximate normal distribution in Fig. 1. 
Differences among the three curves were 
studied by the analysis of variance which is 
summarized in Table 1. The proportion of the 
100 Q-sort items sorted into each of the classes 
along the continuum of appropriateness of self- 
description constituted the scores for each S. 
The arc-sine transformation was applied to the 
scores. Bartlett’s test was computed on the 
transformed scores and the homogeneity-of- 
variance assumption supported. Analysis over 
the three curves failed to show significant 
differences. Separate F tests, however, com- 
paring each curve with each of the others 
showed significant dissimilarity between the 
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TABLE 1 
ANALYSIS OF VARIANCE OF PROPORTION OF ITEMS 
PLACED AT Eacu Crass INTERVAL 
(Data in the form of arc-sine transformation 
for proportions) 


Source df MS F 
Group X class 16 | 7790.4 | 1.33 
Pooled Ss X class 216 5841.7 
Group (excluding neurotic) 8 | 13596.7 | 2.32* 


X< class 
Pooled Ss (excluding neu- | 144 5851.6 
rotic) X class 


Group (excluding college) X 8 7243.0 | 1.23 
class 

Pooled Ss (excluding col- | 144 5892.2 
lege) X class 

Group (excluding psychotic) 8 2530.5 


Xx class 
Pooled Ss (excluding psy 144 5788 
chotic) X class 


bo 


* Significant at the .05 level 


curves for the college and for the psychotic 
groups (p < .05). The objection might be 
raised that declaring such an outcome “signifi- 
cant” is, to an extent, “capitalizing on chance”’ 
because of the interdependence of the three F 
tests. In this case, however, the rank relation- 
ship obtained among the three curves is a 
specifically predicted one—that maladjusted 
Ss will show greater deviation in the shape of 
their distribution from the approximate normal 
distribution than will less maladjusted Ss, with 
greater degrees of maladjustment (neurotic, 
psychotic) showing greater degrees of devi- 
ation. Thus, it would seem that the bias in- 
herent in ascribing significance to this com- 
parison is not as great as might otherwise be 
the case. 

The comparison of curves for Ss within 
groups was made by contingency chi squares, 
as indicated in Table 2. For each of the three 


TABLE 2 
CONTINGENCY Cnt SQuarRES COMPARING SUBJECTS’ 


Curves WitTstin Eacnh EXPERIMENTAL GROUP 


Group df x? 

College 72 1986880 .0* 
Neurotic 72 30751.8* 
Psychotic 72 356118 .0* 


* Significant at .001 level. 


groups the chi square was based on a work 
table of ten Ss’ sorting of items into nine class 
intervals. Significant differences were found 
among the item distributions of Ss com- 
prising each of the experimental groups, indi- 
cating that none of the groups could be char- 
acterized by a single distribution shape with 
only minor subject-to-subject variation from 
it. Further analysis, however, may yield two or 
three composite curves for a given group which 
together subsume a very high proportion of 
the total variance. In general, two major 
shapes are apparent, those skewed to the right 
and those skewed to the left. 


DISCUSSION 


The hypothesis drawn from current Q-sort 
procedures that there is a quasi-normal dis- 
tribution of the degree of development of traits 
within individuals is not supported by the 
findings of this study, as was predicted. The 
application of chi square to a comparison of 
obtained curves with the approximate normal 
distribution resulted, as noted earlier, in signifi- 
cant differences for all Ss, regardless of grouy 
membership. The distribution of the universe 
of traits within individuals, assuming that the 
SIO Q-sort items are a random sampling of 
that universe, is clearly indicated to be non- 
normal and as varying from individual to 
individual. There is no evidence for a single 
characteristic distribution of any kind. 

It may be worth noting that the lack of 
congruence between the forced quasi-normal 
distribution and the “‘free-sort” distributions— 
which may be regarded as more accurate esti- 
mates of the distribution of the trait universe— 
indicates distortions in the absolute values of 
product-moment correlations computed be- 
tween Q sorts. Thus far, this possibility would 
not seem a major problem because Q-sort re- 
searches have been typically concerned with 
the identification of direction of change in 
certain correlations, an aspect which is less 
affected by such a lack of congruence. 

The second prediction, that deviation from 
normality of distribution increases with in- 
creasing maladjustment, received partial verifi- 
cation by the results of this study. The three 
composite curves—college, neurotic, and psy- 
chotic—seem increasingly U-shaped in that 
order (Fig. 1). Although this order corresponds 
to the one predicted, only the difference be- 
tween the college and psychotic groups is 
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statistically significant. These findings suggest 
that the most maladjusted (specifically, psy- 
chotic) Ss perceive a more restricted, limited 
range of intensity of references to the self. Such 
individuals in general react to self-reference 
material as either very “true” or very “‘untrue” 
of them. Responses indicating an acceptance of 
variable feelings or behavior, such as state- 
ments sorted near the middle of the scale, are 
much less common. To put it in information 
theory terms, it appears that more maladjusted 
persons attempt to organize the world in con- 
cepts of low information and high redundancy 
or predictability. The maladjusted individual 
appears less able than others to tolerate con- 
cepts of high variability or low predictability, 
at least in relation to himself. In contrast, the 
college Ss, while still deviating significantly 
from the normal distribution, sorted many 
more statements into the center piles of the 
scale, thus accepting more readily concepts of 
self as more variable and of lower predicta- 
bility. As indicated, the neurotics responded in 
the direction of the psychotics but did not 
differ significantly from the college group. 

The results suggest distinct advantages in- 
herent in the “‘free-sort” procedure employed in 
this study. In general, the use of this procedure 
seems to preserve the great amount of informa- 
tion that lost in forced distribution 
methods. The possibility of differences in shape 
of distribution between groups of varying 
degrees of maladjustment suggests that mean- 
ingful personality correlates of shape of dis- 
determined. As noted 


is now 


tribution might be 
earlier, two characteristic shapes of curve seem 
to emerge on inspection, one skewed to the 
right and one skewed to the left. These dis- 
tributions may be thought of as representing, 
respectively, primarily repressive and primarily 
sensitized orientations. In the one case, Ss have 
tended systematically to deny the self-reference 
of the statements; in the other, they have 
tended to perceive the statements predomi- 
nantly highly applicable to themselves. Once 
the stability of the shapes of ‘‘free-sort” 
distributions is known, the significance of 
temporal changes in shape of distribution under 
various personality or therapeutic conditions 
might be profitably studied. It would seem, 
then, that “free-sort’”’ procedures, through 
their property of retaining information, would 
lead to an extension and continued develop- 


ment of Q technique and related correlation- 
between-persons methods. 


SUMMARY 


Current Q-sort methodology, through its 
characteristic use of forced, quasi-normal dis- 
tributions, appears implicitly to support the 
theoretical position that the characteristic 
shape of distribution of traits within indi- 
viduals is a quasi-normal one and that there 
are not significant differences in shape of 
distribution of traits among individuals. It was 
the purpose of the present study to assess 
experimentally this theoretical formulation. 
Three groups of Ss, college, neurotic, and psy- 
chotic, were administered the SIO Q sort with 
instructions to sort the items on an equal- 
appearing-intervals scale with the same range 
as that customarily used but with an unspeci- 
fied or “free” distribution rather than a forced. 
quasi-normal distribution. 

The results showed that each S’s “free-sort”’ 
distribution varied significantly from the ap- 
proximate normal distribution customarily im- 
posed in use of the SIO Q sort and that within 
each of the groups, Ss’ shapes of distributions 
varied significantly from each other. Composite 
curves were constructed for each of the three 
groups. An analysis of variance over the three 
curves failed to show significant differences. 
Separate F tests comparing each curve with 
each of the others showed significant dissimilar- 
ity only between the curves for the college and 
for the psychotic groups, with the psychotic 
group showing the greater deviation from the 
approximate normal distribution. The inter- 
dependence of the three F tests, however, 
raises some question as to the actual signifi- 
cance of the fanding. 

It was concluded that current forced-dis- 
tribution procedures result in a significant loss 
of informaticn which may be retained by use 
of “free-sort” procedures of the type described 
here. The possibility of differences in shape of 
distribution between groups of varying degrees 
of maladjustment suggests that meaningful 
personality correlates of distribution shape 
might profitably be investigated. 
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MOTIVATION AS A VARIABLE IN WORK-PARTNER SELECTION! 


ELIZABETH G. FRENCH 


Personnel Research Laboratory, Air Force 


HE hypothesis tested in the present 

experiment is that the behavior of an 

individual making a work-partner choice 
between a competent nonfriend and a less 
competent friend can be predicted from 
knowledge of his goal orientation. Specifically 
it was hypothesized that Ss with high achieve- 
ment motivation would select the competent 
nonfriend; those with high affiliation motiva- 
tion, the less competent friend. The Ss high 
in both motivations were expected to show 
evidence of conflict. 


METHOD 


In a group session, friendship ratings and achieve- 
ment- and affiliation-motivation scores were obtained 
from the Ss and the importance of doing well on a test 
of “concept formation” was stressed. Then the Ss were 
divided into groups of four on the basis of the friendship 
ratings. Each group made up of three mutual 
friends and an S who was not considered a friend by the 
The “concept formation test’? was ex- 


was 


other three. 


plained to each group in turn and the four Ss worked on 


it individually. The nonfriend (labeled SS) was per- 
mitted to succeed; the other three were made to fail 
On the expiration of the time limit, the Ss were told 
that they were to work on a similar test in pairs and 
asked to write down their choice of a work partner. The 
choices of the three unsuccessful Ss comprised the ex- 
perimental! data. 

Subjects. The Ss were airmen in their seventh week 
of basic training at Lackland Air Force Base. A total of 
204 men, members of seven flights, took part. The 
data are based on 137, as explained above 

Friendship ratings. Twenty-eight or 32 members of 
each flight reported as a group. They were to!d that 
they were to take part in an investigation of factors 
related to formation of friendships in the Air Force. 
Each man was given a roster containing the names of 
all the men present and asked to rate them 1, 2, or 3 
according to whether he liked the man in question and 
considered him a friend, 1, didn’t know him very well 
or was indifferent to him, 2,or disliked him, 3. In ad 
dition they checked their three best friends and any 
men they particularly disliked. The flight is the training 
group into which the men are organized as soon as they 
arrive on the base. They live, train, study, and work 


! This investigation was carried out under the Air 
Force Personnel and Training Research Center in sup 
port of Project No. 7704. Permission is granted for 
reproduction, translation, 
posal in whole or in part by or for the United States 
Portions of this paper were read at the 


publication, and use or dis 


Government 
1955 meeting of the American Psychological Associa- 
tion 
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Personnel and Training Research Center 


together in virtual isolation from other groups during 
the training period. It is reasonable to assume that by 
the end of seven weeks patterns of friendship would be 
stable and meaningful. 

Motivation measurement. After the Ss had completed 
the friendship ratings, they were told that the second 
purpose of the investigation was to determine the rela- 
tive ability level of men entering the Air Force at differ- 
ent times. It was explained to them that it was very 
important for them to do as well as possible on the 
“concept formation” test they were soon to take. This 
was done to provide an achievement orientation similar 
to the affiliation orientation presumably resulting from 
the making of the friendship ratings. Then the motiva 
tion measurement device was presented as a task unre 
lated to either of the main purposes of the experiment. 
This instrument has been described in detail elsewhere 
(1, 2)?. Called a “‘Test of Insight” it is presented to the 
S as a measure of his ability to understand the behavior 
of others. The test items are single-sentence descriptions 
of behavior supposedly characteristic of an individual 
with whom S can identify. (Example: “Tom always 
lets the ‘other fellow’ win.”’) Th “explain” the 
behavior. The number of and 
affiliation-relevant responses make up the scores 

Composition of work groups. While the Ss 
taking the Test of Insight, the friendship ratings were 
to a work group 


e S is to 
achievement-relevant 
were 
examined and each S was assigned 
with the following considerations in mind. Preliminary 
investigation had revealed that if best friends—Ss who 
rated each other 1 with a check—were put in the same 
group they invariably chose each other. Also if the 
nonfriend were either strongly disliked by the other 
work-group members, or very unpopular in the flight as 
a whole, he was never chosen. The artificially created 
necessity to succeed on the task could not be made as 
strong as the more extreme likes and dislikes. Therefore 
best friends were not grouped together nor was the 
SS for any group one whom any other member had 
checked as especially disliked. Secondly, approximately 
half the Ss failed to use the 3 cate 
scale at all. In these cases their 
sidered to indicate lack of friendship and the SS for 
their groups were drawn from those Ss so rated. Finally, 
the SS was selected to have an Armed Forces Qualifica- 
tions Test score equal to or higher than the other 
members of the group in order to give plausibility to the 
success and failure manipulations. Each group, then, 
was composed of three Ss who had rated each other 1 
without checking the rating, and one S with a high 
AFQT score who was rated 3 (or 2) by the other three 
Ss. In 16 of the 51 groups it was possible to satisfy the 
criteria for only two of the three The data 
for the third Ss in these groups, of course, were not 


gory on the friendship 


? 


ratings were cor 


rea 


critical Ss 
used. 
2 Copies of a paper describing the test and its devel- 


opment and directions for scoring may be obtained 
from the author 
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The experimental situation. The experimental room 
was equipped with four individual booths so arranged 
that the Ss could not see each other but all could see E 
and a blackboard. The four Ss were seated in the booths 
and E wrote each S’s name on the board. The E then 
reminded the Ss of the importance of succeeding on the 
task and gave them their instructions. The task in- 
volved sorting 30 cards containing various combinations 
of straight and curved lines, segments of geometric 
figures, and complete figures into categories. There was 
no logically compelling solution and almost any sort 
could be called “correct.”” The Ss, however, were told 
that there was only one correct answer. The instruc- 
tions were: 

“These cards contain designs which can be sorted 
into a definite and specific number of categories. Your 
task is to decide what the categories are and arrange 
the cards accordingly. There is only one right answer 
but you may have more than one trial. When you think 
you have the solution raise your hand and I will check 
it. You can offer as many solutions as you like until the 
time is up. The time limit will be five minutes after the 
first man gets it right.” 

The first solution offered by the SS was accepted as 
correct, his success was announced, and his name was 
circled on the blackboard. All solutions offered by the 
other Ss were called incorrect. Five minutes after the 
SS had finished, time was called. The £ then explained 
that the Ss were to work on a similar task in pairs and 
asked each man to write down the name of the man 
with whom he would like to work. In order to permit 
an expression of conflict, the Ss were told that if there 
were two men with whom they would like to work 
equally well they might write down both names. 

The Ss were paired and worked on a second set of 
cards but the critical part of the experiment ended with 
the choice of a partner 


RESULTS AND DISCUSSION 


The achievement-motivation scores ranged 
from 0 to 15 with a mean of 5.65 and an SD 
of 2.98. The median fell between 5 and 6. For 
affiliation motivation the range was 0 to 15 
with a mean of 4.55, an SD of 2.48, and the 
median between 4 and 5. The Ss were divided 
into four motivation groups on the basis of 
these scores: those with achievement-motiva- 
tion scores above the median and affiliation 
below (High Achievement); those with 
affiliation above the median and achievement 
below (High Affiliation); those with both 
means above the median (Both High); and 
those with both means below (Both Low). 
Since the two sets of scores are relatively 
independent (r = .17) the number falling 
into each group (Table 1) is approximately 
equal. 

The choices were tabulated in five cate- 
gories: Choice of the SS only; choice of the 
SS first, then a friend (SF); choice of a friend 


TABLE 1 
or CHorces in EAcH 
MOTIVATION GROUP 


FREQUENCY CATEGORY BY 








High 
Achieve- 
ment 


Success-subject only | 12 
Success-subject— 7 
friend 
Friend 
ject 
Two friends 
One friend 


~success-sub- 





and then the SS (FS); choice of two friends 
(FF); and choice of a single friend (F). The 
two orders of choice of SS and friend were 
separated on the assumption that the first- 
listed man might be the preferred one even 
though the instructions were to list two only 
if they were equaliy preferred. To the extent 
that this might be true the SF order was 
expected to be related to relatively higher 
achievement and lower affiliation motivation 
than the FS order. 

Inspection of Table 1 reveals large differ- 
ences in the frequency with which the various 
kinds of choices were made. These frequencies 
are significantly different from chance ex- 
pectancies and well beyond the .001 level 
(chi square is 55.68). The large number of F 
choices suggests that in spite of the attempt to 
decrease the importance of the friendship 
variable in setting up the groups, friendship 
remained more important than task success. 

Even with the widely divergent frequencies 
in the choice categories, the choices of the 
four motivation groups conformed to predic- 
tion (Table 1). One third of the High Achieve- 
ment group chose the SS and over half made 
SS or SF choices. This group made two thirds 
of the total SS choices and over half of the 
SF choices. More than half of the Both High 
group made either SF or FS choices. These 
Ss accounted for half the FS choices. The High 
Affiliation group made no SS or SF choices 
and only four FS. All their other choices were 
either FF or F. This group made over half 
the FF choices and appears to be the only one 
showing much evidence of conflict over choos- 
ing between two friends. The Both Low Ss 
made a few choices in the first four categories, 
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TABLE 2 


MEAN MorIvaATION Scores BY CHoice Groups 


Affiliation 
Motivation 


Achievement 
Choice Group motivation 
Mean SD Mean SD 
Success-subject 8.67 
only 
Success subject 7.67 
friend 
Friend 
subject 
Two friends 2 6.42 
4.14 


success 


One friend 


roughly proportioned to the marginal totals, 
but by far the largest number made F choices. 
This, too, is in accordance with expectation 
when the over-all tendency to F choices is 
considered. A chi-square test applied to this 
table yields a value of 61.71. The values of 
chi square for the individual motivation 
groups compared with the over-all frequencies 
are 21.87 for the High Achievement group, 
11.70 for the Both High, 16.96 for the High 
Affiliation, and 11.18 for the Both Low. The 
significance levels are .001, .02, .01 and .05 
respectively. 

Table 2 presents the mean motivation scores 
of the Ss grouped according to choices made. 
The order of magnitude of both sets of means 
is as expected—achievement-motivation means 
decrease from the SS group to the FF and 
affiliation-motivation increase. That 
the affiliation-motivation mean is not higher 


means 


for the F group suggests that the F choice is a 
result of the characteristics of the situation 
rather than motivation of the individual. 
Analyses of variance of the two sets of motiva- 
tion scores yielded F ratios of 14.52 (p > .001) 
for achievement and 4.74 (p = .05) for affilia- 
tion. The ¢ tests computed on the differences 
between the individual means (not including 
the F group) revealed that although only one 
difference between adjacent means was sig- 
nificant, all four of the differences between 
means one-step-removed were significant at 
the .02 level or better and those between the 
extremes were significant beyond the .001 
level. 

These results suggest that scores on a test 
of motivation such as the one described pro- 
vide a basis for predicting behavior in a choice 
situation. Apparently Ss with high achieve- 


ment motivation are willing to overlook the 
fact that they don’t particularly like another 
man if they think that man can help in the 
achievement of a goal. Conversely, those high 
in affiliation motivation don’t care as much 
about goal attainment as interpersonal rela- 
tions, and so choose their friends. In this 
situation, in fact, they generally chose both 
friends. The Ss with both motivations high 
were unable to make a choice and those with 
both low followed the line of least resistance. 
Although the experiment was not designed 
with any analysis of the initial friendship 
ratings in mind, such an analysis was sug- 
gested by the great variability in the use of the 
3, or dislike, category by the Ss. Apparently 
some Ss were unwilling to express dislike or 
hostility even when the recipient would remain 
unaware of it. This suggested the tentative 
hypothesis that these Ss would be high in 
affiliation motivation, possibly low in achieve- 
ment, and would tend to choose friends rather 
than the SS in the choice situation. Those Ss 
who did use the dislike category might be 
expected to have lower affiliation motivation, 
possibly high achievement motivation, and be 
willing to reject friends and choose the SS. 
As the fact that half the Ss made no 3 ratings 
on the friendship scale suggests, the distribu- 
tion of friendship ratings was badly skewed. 
The Ss were therefore divided into two groups 
those who made no dislike ratings and those 
who made one or more, and the number of 
each falling into the four motivation cate- 
gories was determined. A trend in the pre- 
dicted direction was apparent, with the High 
Achievement group having the greatest num- 
ber of Ss who gave 3’s and the fewest Ss who 
did not, and the High Affiliation group the 
reverse. A chi square over the whole table 
did not reach the .05 level but a comparison 
of the two extreme groups yielded a chi square 
of 3.85 (p = .05). The breakdown of the users 
of 3’s vs. nonusers by choice-groups was also 
according to prediction with a chi square of 
10.29, significant at the .05 level. 


SUMMARY AND CONCLUSIONS 


The present study was designed to test the 
hypothesis that the behavior of an individual 
making a choice between a work partner who 
was a competent nonfriend and one who was a 
less competent friend can be predicted from 
knowledge of the relative strength of achieve- 
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ment and affiliation motivation. The Ss first 
made friendship ratings and took a test de- 
signed to measure both achievement and 
affiliation motivation. Then they were divided 
into groups of four to work on an “important 
concept formation test.”” The groups were so 
constructed that three of the Ss, the critical 
Ss, had rated each other as liked and the 
fourth as not liked. They all worked individ- 
ually on a sorting task at which the disliked 
S was made to succeed and the others to fail. 
Then all Ss wrote down either one first or a 
first and a second choice for a work partner. 

Chi-square analyses of the four motivation 
combination groups indicated that, according 
to prediction: 

1. The Ss high in achievement and low in 
affiliation motivation made significantly more 
single choices of the success-person. 

2. The Ss high in affiliation and low in 





achievement made significantly more choices 
of friend and the success-person. 

3. The Ss high in both motivations made 
significantly more double choices involving 
both a friend and the success-person. 

4. The Ss low in both showed no patterning. 

In addition, the achievement-motivation 


mean scores showed a significant increase and 
the affiliation a significant decrease from Ss 
choosing the friends through Ss choosing both 
friend and success-person to Ss choosing the 


success-person only. 
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EFFECTS OF DIFFERENT CONDITIONS OF ACCEPTANCE UPON 
CONFORMITY TO GROUP NORMS! 


JAMES E. DITTES 


Vale University 


MONG the variables influencing a per- 
son’s conformity to a group’s norms, 
two interrelated factors are likely to 

be important: (a) the extent to which he is 
attracted to or values his membership in the 
group; and (d) the extent to which he feels 
that other members are attracted to or value 
him. With respect to the first, it has been 
demonstrated that the more highly a person 
is attracted to a group, the more he conforms 
to the face-to-face pressures operating within 
it (1, 3) and the more he resists counternorm 
communications from outside it (6). The ex- 
periment reported here attempts to hold con- 
stant this first variable and to determine the 
effects on conformity of the second. 

A relationship of mutual dependence is pre- 
supposed between the individual and group. 
Just as the individual is dependent on other 
members (a relationship emphasized in re- 
search on attraction to or valuation of the 
group), they also, to some degree, depend 
upon him—his skills, knowledge, and general 
ability to contribute to group life. Through 
their behavior they can be expected to com- 
municate the value they place on him (to be 
referred to as their acceptance of him) and 
their desire to have him continue as a member. 
The present research varies this information 
concerning a person’s acceptance in a group 
and investigates the effects on his conformity 
to its norms 

The effect of either variable on conformity 
is probably mediated by various intervening 
acquired motives. We would assume that in our 
culture, conformity to group standards is 
generally learned as a means of satisfying 


1 This experiment was conducted as part of a re- 
search program on communication and opinion change 
operating at Yale University on a Rockefeller Founda- 
tion grant and directed by Carl I. Hovland. The 
authors gratefully acknowledge the excellent coopera- 
tion of the members of the Yale Class of 1958 who 
served as subjects. Members of the 1954-55 senior 
seminar on the psychology of the small group provided 
expert assistance in the conduct of the experiment. The 
authors are also indebted to Arthur R. Cohen, Carl I. 
Hovland, and Ben Willerman for their suggestions on 
the manuscript. 
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many acquired motives, activation of which 
results, given appropriate conditions, in con- 
formity behavior. The range of motives 
involved seems to include at least two fairly 
distinct clusters, one consisting of ‘‘approach” 
motives such as tendencies to help the group 
and to pattern one’s self after admired persons, 
the other of anxiety-based motives such as 
desire to avoid criticism and 
membership. 

It is probable that valuation of membership 
affects conformity through both kinds of mo- 
tives, but that feelings of acceptance affect 
the person’s sense of security about remaining 
in the group, which in turn affects primarily 
those motives based on anxiety. Assuming 
valuation constant, information communi- 
cated from fellow members that a person is 
little accepted by them increases his sense of 
insecurity, activating various acquired motives 
(to avoid social criticism, etc.), to which 
conformity behavior has been learned. Persons 
informed that they are well accepted in a 
group feel secure in their membership and 
have little anxiety-based motivation to con- 
form. 

In the present investigation, we are inter- 
ested in two kinds of such information: (a) 
information about how much the person is 
accepted and (6) information about how stable 
the acceptance evaluations are. Either item 
would be expected to influence insecurity. 
Because of practical limitations, it was not 
possible to carry out a factorial design per- 
mitting evaluation of the separate effects and 
interaction of these two variables. It was 
decided that an investigation of their joint 
effects would provide a better initial test of 
the fruitfulness of our analysis than a study 
of either variable alone. Throughout the re- 
port, the label ‘‘acceptance”’ applies to both 
these components. 

The hypothesis we propose to test is: Among 
persons who attach equal importance to their 
membership in a group, those who receive 
information that they are only minimally 
accepted by their colleagues and that this 
evaluation is subject to change, possibly 
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becoming worse, conform more than persons 
who receive information that they are highly 
accepted and that this situation is stable. 

A similar hypothesis was tested by Kelley 
and Shapiro (5) who found it supported by 
correlational evidence that noncon‘ormity was 
associated with high actual popularity or 
acceptance. But the hypothesis wus not sup- 
ported by the experimental manipulations of 
fictitious information about acceptance. It 
was suggested that this relationship was 
obscured because valuation of membership 
was not held constant. They found that in- 
formation that one is poorly accepted affects 
not only his feelings of acceptance but also his 
subsequent valuation of the group. As our 
analysis suggests, a decline in valuation should 
decrease the “approach” motives which 
prompt conformity (e.g., desire to aid the 
group) thus offsetting any increase in other 
conformity-prompting motives resulting from 
the insecurity derived from feeling poorly 
accepted. 

The present experiment was designed to 
avoid this obscuring effect in two ways: (a) 
It was attempted, with partial success, to 
keep valuation uniform by motivating all Ss 
so highly to participate in the group that their 
valuation of membership would remain con- 
stantly high throughout the experiment. (6) 
Several degrees of acceptance were introduced, 
instead of two, with the expectation that even 
if valuation varied significantly between the 
extreme conditions of acceptance, it might 
remain uniform between more similar condi- 
tions, particularly with the conditions of 
relatively high acceptance. In this case the 
hypothesis could still be tested by comparing 
the conformity among persons in those degrees 
of acceptance for which valuation remained 
relatively uniform. 


METHOD 


Subjects. One hundred and three volunteers from the 
Yale freshman class met in 18 five- and six-man groups. 
Members of each group had no prior acquaintance with 
one another. 

Incentives to motivate Ss to value and participate in 
their group, and instructions about possible rejection. At 
the outset, each S agreed to work with his group in a 
contest against the other experimental groups. A cash 
prize and prestigeful recognition were offered as awards 
for the group best in efficiency, smoothness of working 
together, and soundness of decisions. These instructions 
carried the strong implication that unanimity in group 
decisions was highly desirable. 


It was then announced that to guarantee an effec- 
tively working group, members would have the option 
of eliminating from their group any person who ap- 
peared detrimental to its success. For this purpose, 
subsequent tasks were to be interrupted periodically 
while each member anonymously rated each other one 
on this question: 

How desirable is it that this person be kept in the 
group? 

———extremely desirable 

———-very desirable 

———somewhat desirable 
-not very desirable, but he should be kept in 

———he should be rejected from the group 
It was made to appear likely that some Ss would re- 
ceive low ratings and that this would be unpleasant 
and penalizing for them: they would have to undergo 
the embarrassment of defending themselves in an 
open discussion of whether they should be rejected; 
rejection would carry the implication that the person 
was inadequate in his “social adaptability.” (While the 
ratings were actually collected during the subsequent 
period, no S was ever discussed for rejection.) 

Introduction of “delinquent gangs” problem and 
achievement of unanimous decision; interruptions to 
obtain ratings of others. In each group, an initial dis- 
cussion concerned the relative worthiness of two gangs 
of juvenile delinquents, described in detailed simulated 
court records. A difference in the records insured that 
the same gang would be judged better in all groups, 
but this difference was slight enough that the decision 
could emerge only after considerable discussion. To 
heighten the tendency to conformity, the problem was 
described as being much like a jury’s in that solution 
required the agreement of all. During the discussion, 
an observer kept a tally of each man’s participation and 
whether his comments were for or against the gang 
ultimately favored by the group 

After general agreement was reached in free dis- 
cussion, the norm was crystallized and registered by 
having the group rate the two gangs on several scales 
such that one gang was assigned labels such as “very 
deserving” and “fundamentally good’’; and the other 
gang was characterized as “vicious,” “‘malicious,” etc. 
During this process EZ refused to record any rating 
until it was concurred in by every member. After this 
public agreement, Ss privately rated the gangs on 
different scales, to provide a measure of their initial 
acceptance of the group’s characterization of the two 
gangs. 

The discussion of the gangs was interrupted three 
times to permit the members to rate one another as to 
the desirability of their remaining in the group. Each 
time, E quickly thumbed through the ratings, pretended 
that no S had received low enough ratings to warrant 
discussing his possible rejection, and “discarded” the 
ratings into a wastebasket. 

Introduction of different conditions of acceptance. At 
the end of the “‘gangs”’ discussion, E suggested a rest 
period. While Ss were relaxing, he casually inquired 
whether they’d be interested in seeing how they had 
been rated. Before anyone could object, E retrieved 
the slips from the wastebasket and distributed them. 
In this manner, each S was allowed to see privately 
what he thought to be the ratings made of him by the 
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fictitious ratings 
and substituted for the originals 


other members. These were in fact 
prepared in advance 
in the wastebasket. So that Ss would similarly interpret 
these ratings, E he had the 


very desirable.” In each group, 


announced that noticed 
average to be around 
one person found his ratings to be mostly higher than 
this 


mostly at 


ratings 
two had 
; and one 


In 5- 


average (high conditior two received 


condition); 


this average (average 


ratings slightly below average (low condition 


} 


received ratings well below (very low conditior 





man groups, only one of the intermediate conditions 


was represented by two persons.) Information about the 


probable stability of the ratings was provided by £’s 


statement, as thou; with ratings in 


1 from experience 
other groups, that higher the rating the less likely 


it was to change; 





low ratings were quite likely to 


ing higher or even lower. It was 


change, possibly 


intended that as a result of this information, a person 








in the high condition would feel highly accepted, and 
that this was a stable situat the other extreme, 
a person in the very low condition was supposed to 
feel very little acceptance und that his colleagues’ 
evaluations might change, even becoming worse 

Before each sessi the four conditions of acceptance 
were randomly assigned to positions aroun the discus 
sion table. The 103 Ss were distributed among the 
conditions as follows: 18 in high, 33 in average, 34 in 


low, and 18 in very low. The intermediate conditions 





assigned more Ss because greater variability of 
then 


information abou 


were 
behavior was anticipated ir 

1 ddittor the gangs, private 
and pu After the 
introduced and interpreted, the group 


ase of the 





judgment lic dtscusston fictitious 
ratings had beer 
proceeded to the second ph “gangs” problem 
The 
deviate 

to deter 


norm, 


purpose of this phase was to introduce pressure to 
from the group consensus about the gangs and 
ine each person’s subsequent adherence to the 


uted in his further private ratings of the 
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the new information was assumed to represent con- 
formity with the group’s original norm (these scores are 
reported as Gang Index 1). The Ss also made additional 
ratings of the gangs after reading this information, one 
set anonymously and one set supposedly for public 
comparison (Gang Indices 2 and 3 respectively). To 
obtain an indication of each S’s speed of responding to 
the contradictory information, the observer noted the 
order in which Ss finished making their evaluations and 
ratings 

Following the ratings, the group openly discussed 
the gangs for a short period during which were made 
observations comparable to those made during the 
initial discussion of the gangs 

Administration of number-judgment problem. A second 
situation for assessing conformity was next introduced, 
using a task requiring simple comparative judgments 


of numerosity—judging which of two squares contained 
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more dots, the same problem used by Kelley and 
Shapiro (5). The problem was presented to Ss working 
privately, in a squares of 
decreasing difficulty, in which the correct answer was 
entire series. The Ss were to share 


series of eleven pairs of 
the same for the 
ideas and try to improve their group score by exchang- 
ing written messages between successive judgments. 
In fact, their messages were not delivered, but were 
intercepted and replaced by a standard set of messages 
which led each S to believe that al 
of his group had decided that a given square, A, was the 
To heighten the pressure 


the rule 


the other members 


correct answer for the series 


to conform to this consensus was imposed 


that the group would 


eleven successive judgments only when the group was 


score points on any one of the 


unanimously correct 


Pressure to deviate from the 
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On each pair, S reported his 


sasingly clear evidence ynsensus 





and 
“public vote” to 


“private opinion” 
degree of confidence in it) and also a 


be tallied toward the group score. These were summed 


over the last ten judgments (the ones made following 


receipt of messages indicating the group consensus), 
yielding Dot Index 1 for the private oj 
and Dot Index 2 f 


noted that these conformity ine 





inions (weighted 





by confidence r the public votes. 
It will be 1 


flect two aspects of coniormity 1 


ices may re- 
initial acceptance 
of the apparent consensus and (b) continued adherence 
to it in the face of the contradictory evidence. Actually, 
there is little variability in conformity on the early 

the being high), so 
indices reflect primarily the second aspect 


The Ss’ 


experimental conditions of acceptance were measured 


general level these 


judgments 


Subsequent measure a perceptions of 


by a direct question immediately after they saw the 
ratings and at the end of the experiment, by recall of the 
b) Valuation of membership 
after receiving the ratings 
Ss answered 


ratings they had received 

in the group: Immediately 
and also at the end of the 
short questionnaires containing items measuring their 


experime nt 


remain in their particular 
meetings, 
liking to work with the other mem The Ss’ 
acceptance On a 


positive motivation to 
group, e.g., desire to be invited to further 


bers c 


interpretations of conditions of 
questionnaire given several months after the experi 
ment, Ss indicated how likely they considered it to be 
that they might be rejected, and the 
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judgment. 

Termination of experimental session. At the end of 
each session, the purpose and procedures of the experi 
ment told that the 
ratings they had seen were fictitious. These disclosures 


were disclosed and the Ss were 


were greeted with laughter, expressions of relief, and 
At no time did any 


S indicate that he had clearly doubted the genuineness of 


even a certain amount of disbelief 
the ratings he had seen. Special care was taken to 


reinvolve as active and obviously accepted group 
members those Ss who had been in the conditions of 
low and very low acceptance. The Ss were requested to 
maintain secrecy during the remainder of the weeks 


scheduled for the experiment. Their excellent coopera- 
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tion in this provides one of several indications of the 
favorable attitudes with which they left the experiment. 


RESULTS 
Direct Effects of Experimental Conditions 


Perceived acceptance in group. The fictitious 
ratings clearly produced the intended differ- 
ences among the experimental conditions in 
amount of perceived acceptance. The ques- 
tion, “From the point of view of the group, 
how desirable is it that you, yourself, be kept 
in?’”’ (“extremely desirable” scored as 5 and 
“T should be rejected from the group” scored 
as 1), yielded average scores of 4.7, 3.8, 3.3, 
and 2.9, respectively for the high, average, 
low, and very low conditions. Analysis of 
variance reveals that the between-condition 
variance is significant at beyond the .001 level 
of confidence. Furthermore, at the end of the 
experiment, Ss were able to recall their ratings 
accurately. The recalled ratings did not differ 
significantly from those actually received. 

Perceived likelihood of being rejected. Several 
questions dealing with perceived likelihood of 
being asked to leave the group and with pre- 
occupation about this possibility serve largely 
to differentiate the very low condition from 
the other three, the very low Ss viewing rejec- 
tion as more likely. The very low’s differ 
significantly (p < .01) from each of the other 
conditions which do not differ significantly 
among themselves. These questions were 
asked only in the questionnaire given some 
months after the experiment so the results 
must be viewed with some reservations. 

Valuation of membership. The efforts to 
keep all Ss highly attracted to the group were 
only partially successful. The high and average 
conditions showed the highest and approxi- 
mately equal levels of attraction while the 
levels for the low and very low conditions were 
markedly lower, especially the latter. This 
effect was the same whether measured imme- 
diately after the ratings were distributed or 
at the end of the experiment. The total valua- 
tion scores yielded a between-condition vari- 
ance which is significant at beyond the .001 
level of confidence. Table 1 indicates the 
different valuation scores and the statistical 
significance of the differences between adja- 
cent conditions. 

Because valuation of membership was kept 
uniform only for high and average conditions, 


rABLE 1 
VALUATION OF GrouP MEMBERSHIP BY FouR 
EXPERIMENTAL CONDITIONS OF ACCEPTANCE 


x . Very 
Condition of acceptance | High Low pes 


| 
Average | 


Average valuation of 40 
membership in 
group 

! ' 

Significance of differ- | p > .50|p < .0001\/p < .0001 

erence between ad 

jacent conditions 


major interest in the subsequent results will 
be in comparisons of these two conditions. 
In these, we may expect there to be little 
interaction of valuation with our two main 
variables, acceptance and conformity. 


Effects on Participation 


The Ss in the very low condition reduced 
their participation in the discussion of the 
delinquent gangs by about 50 per cent after 
seeing the ratings (p < .01 for the change from 
the prior amount of participation). In contrast, 
average acceptance tended to result in increased 
participation, the difference between before 
and after being significant at the .09 level of 
confidence. The amount of participation after 
the ratings were received, expressed as a per- 
centage of the amount of prior participation, 
is as follows: 93 per cent, 131 per cent, 118 per 
cent, and 56 per cent, respectively for high, 
average, low, and very low. 


Effects on Conformity 


Conformity in responding to questionnaires. 
Average values of the conformity indices 
derived from the questionnaires used with the 
two tasks are presented in Table 2, high posi- 
tive scores indicating high conformity, nega- 
tive scores low conformity. The gang-judgment 
indices were adjusted to take account of 
individual differences in initial conformity to 
the group decisions and intergroup differences 
in conformity behavior.2 The three separate 

*The scores on initial acceptance of the group 
decisions on the gangs were obtained before the experi- 
mental manipulations of acceptance and, as might be 
expected, an analysis of them reveals no differences 
approaching significance among the Ss who later found 
themselves in the different conditions. In order to 
eliminate these initial individual differences from the 
measures of subsequent conformity, each S’s score on 
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TABLE 2 
AVERAGE CONFORMITY SCORES FOR THE Four 
EXPERIMENTAL CONDITIONS OF ACCEPTANCE* 


| Experimental conditions p values for 
of acceptance 


Average 


Conformity High 


Index ome wo . . - vs. vs. Low 
High Aver —_— Very | Aver- | and Very 
age Low age | Low 
Gang judgments 
1 —.22| .07| .10 |—.09 
2 —.41| .32/—.10] .09| <.02 10 
3 —.31 20 |—.07 -04 | <.10 
Over-all gang —.92 57 |—.05 02 | <.05 
Dot judgments: 
1 39.0 (43.4 (33.2 (36.4 -02 
2 7.7 18.2 | 7.2 | 6.7 | <.05 
Over-a:l Con- | 2.8 3.2 2.5 2.6 .06 <.05 
formity 


* Positive vaiues indicate high conformity; negative values, 
low conformity. The significance of the difference between condi- 
tions on the over-all conformity index was tested with chi square, 
dividing those who were above their group mean on a majority 
of the five indices from the remainder. A é test was used to test the 
significance of the other differences noted. 


gang indices were summed to give an over-all 
gang-judgment score. Also the five different 
measures were combined into an over-all 
conformity score by counting for each S§ the 
number of his scores on the five measures 
which were above the average of his experi- 
mental group. The average of these numbers 
for each experimental condition is presented 
in the last line of Table 2. Because the theo- 
retical interest centers on the high and average 
conditions by virtue of their having the highest 
and approximately equal levels of valuation 
of membership, p values are given in Table 2 
for differences between them. Since the two 





initial acceptance was subtracted from each of the 
three later measures of opinion. Analysis of variance 
of each of these difference scores showed that the 18 
experimental groups differed significantly in mean 
level of conformity and also in variability. These 
effects presumably reflect slightly different definitions 
of the norm and differing pressures to conform which 
developed during different experimental sessions. To 
eliminate these differences, on each measure of the 
gang judgment each S was given a standard score 
representing the amount of his conformity with refer- 
ence to the mean and standard deviation of his partic- 
ular experimental group. As noted in the text, the 
dot-judgment scores, based only on behavior occurring 
after the experimental manipulations, reflect both 
initial acceptance of the norms and resistance to con- 
trary evidence, though largely the latter. It was not 
necessary to transform them to standard scores because 
they did not vary significantly among experimental 
sessions with regard either to means or variances. 
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low conditions show the same tendency to be 
lower than the average, p values are also 
presented for the differences between their 
combined mean and that of the average. 

In general, Ss receiving ratings of average 
acceptability tended to show more conformity 
behavior than did Ss receiving ratings of high 
acceptability. The results with low and very 
low conditions are less uniform, possibly for 
reasons connected with their rather low valua- 
tion of membership. They showed less con- 
formity than the average and more than the 
high condition on gang judgments, and less 
than either of the other two on dot judgments. 

The three indices of conformity on gang 
judgments were found to be significantly 
intercorrelated, as were the two indices on dot 
judgments. Significant correlations between 
gang and dot measures were found only for 
the high and average conditions and these had 
values of around .50. Despite this low degree 
of consistency between the two types of con- 
formity behavior, Table 2 shows that the 
difference in conformity between high and 
average conditions of acceptance is consistent 
over the two somewhat different tasks. 

A less direct indication of conformity gives 
results supporting those of Table 2. An ob- 
server noted the order in which the Ss at each 
session completed the task of evaluating the 
additional information that suggested that the 
group’s decision on the gangs was inaccurate. 
It might be assumed that persons oriented 
most strongly toward conforming would give 
less attention to the new information and make 
quick judgments simply on the basis of the pre- 
viously established norm; and that persons less 
motivated to conform would give more con- 
scientious attention to the information and 
hence complete the task more slowly. The aver- 
age rank order of finishing the task is as follows: 
4.2, 2.7, 3.8, and 3.1, for high, average, low, and 
very low, respectively. The condition of average 
acceptance was fastest, indicating, on the basis 
of the above assumptions, most conformity; 
the high condition was slowest indicating least 
conformity. The difference between these two 
is significant at the .05 level of confidence, 
using a chi-square analysis that divided Ss into 
slowest and fastest halves. 

Conformity in open discussion. In Table 2, 
the measures intended to tap private opinions 
(Gang Index 1 and 2; Dot Index 1) and those 
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supposed to reflect public expression (Gang 
Index 3 and Dot Index 2) reveal essentially 
similar trends over the four conditions of ac- 
ceptance. However, as compared with the 
quasi-public nature of the latter measures, 
analysis of actual public expressions of opinion 
shows a somewhat different pattern of con- 
formity from that noted in Table 2. (This find- 
ing suggests that the questionnaire results 
mainly reflect private conformity.) In the dis- 
cussion of the gangs after the additional infor- 
mation had been read, there was a general 
decrease in the proportion of comments that 
expressed conformity to the group’s earlier 
preference for one gang over the other. (Before 
the information, 88 per cent of the comments 
involving opinion were favorable to the group 
norm; afterward, only 49 per cent were so.) 
This decrease was less for the very low condi- 
tion than for the others; i.e., in their actual 
public remarks, the very lows conformed more 
closely to the original norm than any of the 
other conditions. The scores* for the high, aver- 
age, low, and very low conditions, respectively, 
were: —.55, —.47, —.52, and —.23, with the 
greater negative number indicating the greater 
deviation from the norm. Chi-square analysis, 
dividing Ss at the over-all median into high 
and low changers, shows that the difference 
between very low and the other conditions 
taken together is significant at the .08 level. 
The difference between any other condition 
and the rest does not approach significance. 

This result receives partial confirmation 
from responses to a question about how free 
the S would have felt to express an opinion con- 
trary to the group opinion. Very low Ss report 
more hesitancy in this respect than do High Ss 
(p = .07) and the other conditions report inter- 
mediate degrees. 


DISCUSSION 


To simplify the reader’s task of assimilating 
the results and trends reported above, they 
are summarized verbally in Table 3. We now 
consider the interpretation of these findings. 
In several respects, they are consistent with 


2 The score for each S represents the proportion of 
his total remarks in the later discussion which were 
pronorm, minus the proportion of his total remarks in 
the earlier discussion which were pronorm, plus the 
proportion of antinorm remarks in the earlier discussion, 
minus the proportion of antinorm remarks in the later 
discussion. 


the hypothesized relationship between accept- 
ance and conformity. However, the total pat- 
tern of results suggests that the relationship 
depends on more complex intervening variables 
than had been anticipated. 

One set of data consistent with the hypothe- 
sis is provided by the comparison of the high 
and average conditions. Since the relation be- 
tween acceptance and conformity is likely to be 
obscured if valuation varies, as previously dis- 
cussed, the hypothesis can be clearly tested 
only by comparing conditions having fairly 
similar levels of valuation—in this experiment, 
the high and average conditions. Between these 
two, higher conformity behavior was exhibited 
by the condition of lower and less stable accept- 
ance. The available evidence does not indicate, 
however, any difference between high and 
average Ss in feelings of insecurity about their 
membership. If these groups actually do not 
differ in this respect, the higher conformity of 
the average Ss must be explained on some other 
basis than the one underlying our hypothesis: 
that they would be more motivated to avoid 
rejection. Their total pattern of behavior— 
high conformity on the questionnaires, rapid 
reactions to counternorm information, and 
heightened participation in the discussion after 


TABLE 3 


DIGEST OF THE RESULTS OF THE Four EXPERIMENTAL 
CONDITIONS OF ACCEPTANCE 





| Experimental condition of acceptance 














Variable = 
High | Average | Low! Very Low 
Perceived ac- High | Average Low | Very Low 
ceptance 
Perceived likeli- | | Higher than 
hood of being | any other 
rejected } | condition 
Valuation of Moder-| Moderate | Low | Very Low 
membership | ate | 
Participation in | Highest | | Lowest 
group discus- | | 
sion | 
Speed of reacting | Faster than | 
to counternorm | | any other 
information | condition 
Conformity in | Low High | Low | Low 
questionnaire } 
responses (pri- 
vate?) | | 
Conformity in | Higher than 
public state- | | any other 
ments condition 
Felt freedom to | High- } Lowest 
expressdeviant| est 
opinion | 
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learning of their acceptance—is perhaps more 
suggestive of a desire to improve their social 
standing. The S in the average condition places 
considerable value upon the group, but at the 
same time, is not completely accepted in it. 
Since E’s comments indicate there is some pos- 
sibility that his acceptance status may change, 
we might expect him to try to attain complete 
acceptance. Eager participation and uncritical 
conformity to the norms would presumably 
facilitate this improvement in status. That this 
conformity extends even to the expression of 
private opinions on the questionnaire is under- 
standable if we consider that there may be a 
general tendency for upward-mobility-oriented 
persons to identify with and take over the 
values of higher status individuals (7). 

The behavior exhibited by the very lows is 
much more consistent with our initial notion 
of anxiety-motivated conformity. In general, 
and in accord with prior findings (4, 5), group 
membership loses its attractiveness for per- 
sons little valued by their colleagues.‘ Along 
with this loss goes a decline in tendency to ac- 
cept the group’s norm, at least in quasi-private 
settings. In the extreme case, however, where 
acceptance is so low that actual rejection is 
presumably an imminent possibility, anxiety 
about rejection is especially high and the re- 
sult seems to be a pattern of guarded public 
behavior, i.¢e., the very lows withdrew from 
open participation in the discussion and showed 
the highest amount of conformity in the opin- 
ions they did publicly voice. Avoidance of re- 
jection, which is presumably motivated by 
penalties particularly attached to rejection 
(embarrassment, suffering failure in front of 
the £), is of course a variable different from 
the positive motivation to participate in the 
group; so anxiety over rejection may have been 
high for the very lows, even though they had 
little positive motivation to stay in the group. 
(The questionnaire some months after the ex- 
periment indicated that the desire to be kept 
in the group and not be rejected had been as 
high for the very lows as for the other condi- 


‘ This raises interesting theoretical questions which 
cannot be discussed here: Is acceptance by others one 
of the prerequisites for desiring membership in a group? 
Why does valuation not decline with the change from 
high to average conditions, but drops sharply for the 
lower degrees of acceptance? Is this a level-of-aspiration 
effect, in which the motivation to stay in the group 
declines only as the possibility of success in this effort 
drops below some critical value? 
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tions, while at the same time, their estimate 
of the likelihood of this event had been higher.) 
Their public conformity, then, may reflect a 
relatively high motivation to avoid rejection. 
That they exhibit a high public conformity but 
little private acceptance is consistent with 
Festinger’s hypothesis that this particular pat- 
tern of conformity results when the attractive- 
ness of group membership is low, but people 
are constrained to stay in the group by external 
threats or barriers against leaving (2). 

In brief, our results suggest two processes 
linking conditions of acceptance to conformity. 
For persons who value a group, are less than 
completely accepted in it, but have some possi- 
bility of achieving complete acceptance, con- 
formity facilitates such a gain in acceptance. 
Their conformity is unquestioning and extends 
to private opinions as well as to public be- 
havior. On the other hand, persons who are on 
the brink of unwelcome rejection manifest con- 
formity only at the public level, presumably as 
a means of forestalling such rejection. 

The major variable in this experiment— 
security and degree of acceptance—is, from 
several points of view, an aspect of what is 
commonly called ‘‘status.”” We might therefore 
expect our results to shed some light on the 
problem of the relationship between status and 
conformity. In general, we doubt that this re- 
lationship is ever a simple one. Our results and 
interpretations suggest that to predict con- 
formity from status, clarity is first needed as 
to the conditions under which conformity is to 
be observed, whether they involve surveiliance 
by other members or relative privacy. Then, it 
must be asked whether the observed differ- 
ences in status are associated with differences 
in (a) valuation of membership, (0) security in 
membership, and (c) motivation to improve 
one’s acceptance. Under some conditions, as 
when low status is associated with relatively 
high insecurity about membership, it seems 
likely that status is inversely related to public 
conformity. But if there is a marked difference 
between high and low status persons in their 
valuation of membership, as a result of greater 
privileges and satisfactions being associated 
with higher status, status and conformity may 
be correlated positively. 

Undoubtedly, a number of other factors also 
affect this relationship. For example, it is easy 
to imagine instances where special motives 
directed toward conformity would be opera- 
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tive for high status persons, but not for low 
status members. One such case would be that 
where the opinions of a high status member 
have carried inordinate weight in the setting 
of group standards so that they happen to co- 
incide with his private preferences. What ap- 
pears as conformity in his behavior may actu- 
ally be motivated by private considerations 
that preceded the acceptance of his behavior as 
the norm. Another instance may be when spe- 
cial conformity demands are made of high 
status persons, for example, because of the spe- 
cial symbolic value their conformity may pos- 
sess 

These considerations indicate that many fac- 
tors affect the relation between status and con- 
formity. In some instances, the relationship 
may be direct; in other, inverse; and in stil! 
others, perhaps, nonexistent. We believe the 
specific situations and results will be most in- 
telligible when analyzed in terms of factors such 
as those suggested above. 


SUMMARY 


In experimental groups of Ss, two different 
norms were developed: one concerning a social 
value judgment; the other, a simple percep- 
tual judgment. The Ss were experimentally 
made to feel different degrees of being accepted 
by the other members and were then given 
opportunities and incentives to deviate from 
the norms. Subsequent conformity, participa- 
tion, and attitudes toward the group were stud- 
ied in relation to the different conditions of 
acceptance. 

The results point to two contrasting pat- 
terns of conformity evoked by different condi- 
tions of acceptance. The first appears to con- 
sist of a high degree of genuine adherence to the 
norms, as indicated by unquestioning con- 
formity extending even to conditions of pri- 
vacy, and a higher-than-average motivation to 
participate in group discussion. This pattern 
appeared for the experimental condition in 
which subjects enjoyed somewhat less than 
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complete acceptance but probably saw the pos- 
sibility of gaining this status. It is interpreted 
as based on strong positive attachment to the 
group and motivation to improve one’s status 
therein. 

The second pattern is marked by high con- 
formity only under public conditions. It oc- 
curred for the experimental condition of lowest 
acceptance, in which Ss saw total rejection as 
being a likely possibility. The interpretation 
is made that although Ss in this condition have 
lost much of their positive motivation to con- 
form to group standards (indicated by their 
very low valuation of membership and reflected 
in their low private conformity), they may 
nevertheless be concerned about the negative 
consequences accompanying rejection (at least, 
under the conditions represented in this experi- 
ment). Public conformity is seen as a way of 
forestalling this unpleasant eventuality. 
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VERBAL REINFORCEMENT AND INTERVIEWER BIAS! 


DONALD C. HILDUM anp ROGER W. BROWN 


Harvard University 


N SEVERAL recent studies verbal behavior 
has been manipulated by means of selec- 
tive reinforcement. These studies have 

very disturbing implications for the clinical 
and public-opinion interview. In the initial 
experiment Greenspoon (2) asked college stu- 
dents to voice nouns ad libitum. For one group 
of Ss, E murmured ‘“‘Mm-hmm” whenever a 
plural noun was produced, while for a control 
group he said nothing at all. The reinforced Ss 
used more plural nouns than the control Ss. 
A somewhat different technique was developed 
by Taffel (4) and also employed by Cohen 
et al. (1). They required S to form a sentence 
when given a verb and a choice of one of six 
personal pronouns to be used in starting the 
sentence. It proved to be possible to affect the 
frequency with which S used the various 
pronouns by saying ‘‘Good”’ in a flat unemo- 
tional tone after sentences employing the 
desired pronoun subject. Finally, Greenspoon 
has reported (3) the successful use of two 
nonverbal stimuli (a red light and a 190-c.p.s. 
tone) to increase the frequency of plural and 
also of nonplural responses. In all of these 
studies § said that he had not been conscious 
of any connection between his own behavior 
and the reactions of E. This is learning without 
awareness. 

In these experiments reinforcement seems 
to act like the moon on the tides with in- 
evitable and uncontingent effect. For those 
interested in such things there appears a glitter- 
ing new prospect for human manipulation; for 
others only the quiet pleasure to be found in 
any new proof of human stupidity. The verbal- 
reinforcement data will confirm many in their 
suspicion of the methods of clinical and social 
psychology. The therapist who believes in the 
importance of the Oedipus complex could 
elicit Oedipal content by means of selective 
reinforcement. Perhaps a patient could even 
be brought to an appearance of mental health 
through the encouragement of “healthy” ut- 
terances. Is not “Mm-hmm” the very hallmark 
of a therapeutic school? In an opinion interview 


1 This study was supported by the Laboratory of 
Social Relations, Harvard University. 
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the S might be infected with the opinions of 
the interviewer by means of patterned rein- 
forcement. Perhaps verbal reinforcement is a 
mechanism of interviewer bias. 

It can be answered to these alarms that 
neither interviewers nor therapists wish to 
influence their Ss and consequently they 
would not use verbal reinforcement. There 
is no comfort in this answer. Anyone who has 
worked with verbal reinforcement knows 
that our “Mm-hmm” and “Good” are 
ordinarily unconscious and automatic. They 
can only be controlled after some practice. It 
follows that an interviewer might influence 
his S in all innocence and we know from the 
experiments that an S might unwittingly 
accept the influence. Here is powerful social 
influence operating outside the awareness of 
everybody concerned. 

Two important differences between the 
laboratory studies and actual interviews 
should however be noted: (a) Greenspoon and 
the other experimenters have taken as their 
response units particular words or word types. 
Interviewers are not ordinarily pleased either 
by plural nouns or by the use of the first 
person singular. If they are moved to agree or 
disagree it is likely to be with reference to a 
line of thought or an expressed attitude. In 
short, interviewers would be likely to reinforce 
content categories rather than specific ver- 
balizations. (6) In a face-to-face conversation 
there are many ways of communicating agree- 
ment or disagreement—smiles, nods, averted 
eyes. In the laboratory studies of verbal 
reinforcement E sometimes placed himself 
behind the S and sometimes faced him but 
endeavored to avoid giving visual cues. These 
efforts at isolating the independent variable 
may not have succeeded. In any case they 
must have created a rather strange social 
atmosphere unlike the usual interviewing 
situation. 

We undertook an experiment intended to 
correct these two laboratory artificialities. 
The experiment took the form of an attitude 
survey conducted by telephone. The inter- 
viewer asked questions concerning the Harvard 
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philosophy of General Education. He revealed 
a bias for or against the philosophy by his 
reactions to the answers of the interviewee. 
In effect, he played the role of one who favored 
or of one who opposed General Education. 


ited to the 








However, the role-play was lim 


single matter of verbal reinforcement. The S 
could discover E’s bias from his first few reac- 
tions and might then provide more of the 
kind of content that was reinforced. Adminis- 
+} 


he questionnaire by telephone created 


tering 
a situation in which vocal reaction was the 
only kind possible and so the independent 


variable was isolated in a natural way. 


METHOD 


The questionnaire. In order to maximize the likeli 
hood of obtaining a verbal reinforcement effect, a topic 
General Education) was selected on which Ss were not 
expected to have very strong opinions and so were ex 
pected to be open to influence. Also, though E might 
be expected to have an opinion, there was no obvious 
to te 


questionnaire there were 15 questions, with 4 possible 


} 


stereotype | S what that opinion should be. In the 
responses to each: Agree strongly, agree slightly, dis- 
agree slightly, disagree strongly. The statements were 
worded so that agreement with some statements repre- 
sented an attitude favorable to General Education 
while agreement with other statements represented an 
unfavorable attitude. Consequently the interviewer 
reinforced an attitude rather than a particular response. 
To make sure that the favorable or unfavorable nature 
of each statement was clear, we originally wrote 40 
statements intended to sound unlike but really covering 
much the same ground. From these we selected 15 
which on pretest proved to constitute a maximally 
redundant subset. It is generally possible to predict 
from the answer to one of these questions the answers 
to all the others. In addition the questionnaire was 
prefaced with a brief statement defining the philosophy 
of General Education so as to make sure that the S 
would understand which statements favored the phi 
losophy and which were opposed 

Subjects. Forty male students, graduate and under- 
graduate, were sampled from those in the Harvard 
Summer School who listed Cambridge telephone num 
bers and home addresses in the United States. Ten Ss 
were assigned at random to each of four experimental 
groups 

Procedure. The same E conducted all the interviews. 
He identified himself as a member of the Social Rela- 
tions Department and said that the survey was de- 
signed to compare summer-school opinion with that of 
the rest of the student body. He then administered the 
questionnaire. 

For two groups the reinforcement was ““Mm-hmm,” 
pronounced after pro General Education answers for 
one group, and after anti answers for the other group. 
For the remaining two groups the reinforcement was 
“Good,” again with one group rewarded for favorable 
responses and the other for unfavorable responses. Both 


AND INTERVIEWER BIAS 109 


reactions were fed back immediately following an ap- 


proved answer in a neutrally-toned, rising inflection 


The £ (a trained linguist) carefully equated the in 
tonation used in reading questions and responding to 
Ss in all groups 

At the close of the interview S was asked to guess 
E’s opinion on General Education and also to say 
whether he thought there was any bias in the question 
naire or in its administration. If S thought there was 
bias or felt that he knew £’s opinion he was then asked 
whether his answers were influenced by these condi- 
tions 

RESULTS 

The reinforcement effect. Each answer was 
scored from 1 to 4: 1 when an answer was 
strongly in favor of General Education, 2 
when slightly favorable, 3 when slightly un- 
favorable, and 4 when strongly unfavorable. 
Refusals to answer (12 of 600 responses) were 
scored 2.5. The total score for an S had a 
possible range from 15 to 60. The actual range 
for all Ss was from 15 to 39.5. The Ss tended 
to favor the General Education philosophy. 

In Table 1 appear the mean attitude scores 
for the four experimental groups and in Table 
2 are the ¢ scores and probabilities for the 
crucial comparisons among these means. Re- 
inforcement with ‘‘Good” was effective while 
reinforcement with “Mm-hmm” was not ef- 
fective. In fact, with “‘Mm-hmm”’ the obtained 
difference was in the opposite direction to that 
predicted from the reinforcement scheme. 


TABLE 1 
MEAN ATTITUDE ScORES FOR Four 
EXPERIMENTAL GROUPS 


Group Mean SD 
—_ 700d”’-pro 24.95 5.10 
“Good”’-anti 31.75 3.46 
“Mm-hmm”-pro 29.8 3.96 
“Mm-hmm”-anti 27.1 4.06 


TABLE 2 
t ScorES AND PROBABILITIES FOR CRUCIAL 
CoMPARISONS AMONG THE FouR 





EXPERIMENTAL GROUPS 
Comparison * tscore| p 
: ttitite:2: $$ thats 

“Good”-pro vs. “Good”-anti } 3.51 | .01 

’ . ‘ | 
“Mm-hmm”-anti vs. “Mm-hmm’”-pro | 1.43 | .20 
“Good”-pro vs. “Mm-hmm’”’-pro 12.25 | © 
“Mm-hmm”-anti vs. “Good’’-anti 2.62 | .02 


* First member of comparison has score more favorabie to 
General Education 
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With the same pattern of 
“‘Good”’ was more effective than ““Mm-hmm.” 

Awareness of reinforcement. Eight of 20 Ss 
noticed that the interviewer said ‘‘Good”’ but 
only 1 of these assumed that it indicated ap- 
said 


reinforcement 


proval. Only 1 of 20 Ss noticed that E 
“Mm-hmm.” He thought it might have meant 
approval. All Ss rejected the notion that their 
answers had been influenced by the inter- 
viewer’s reactions. 

Of 24 Ss who were willing to guess at E’s 
opinion all but one said that E favored General 
Education. Apparently they base’ this guess 
on the assumption that someone surveying 
opinions on General Education would himself 
favor the principle, rather than on the pattern 
of reinforcement. 

Replication of the experiment. The essential 
design was repeated by members of an under- 
graduate tutorial group with 25 Ss. The results 
were like those of the initial study. The mean 
““Good”’-pro score was lower than the mean 
““Good”’-anti score with p < .05, while the 
reinforced 


mean scores for the two groups 
with “Mm-hmm” 
ferent. The mean 
than the mean “Mm-hmm”-pro score with 
p < .01. The mean “‘Good’’-anti and “Mm 


hmm”’-anti scores did not differ significantly. 


were not significantly dif- 


a ;00d’’-pro score was lower 


This study involved new interviewers and Ss 
enrolled for the academic year at 
Harvard rather than in summer school. 


regular 


DISCUSSION 


In the present interview situation “Good” 


is a reinforcer while ‘“‘Mm-hmm.”’ is not. This 
result conflicts with Greenspoon’s finding that 
“Mm-hmm” affects the frequency of plural 
nouns. If conscious learning were involved, an 
explanation for the discrepancy would be avail- 
able. When an S free-associates aloud for 25 
minutes, he may be expected to search rather 
desperately for some indication from E as to 
the purpose of the task and the proper direc- 
tion to take. In these circumstances he will 
notice “Mm-hmm” 
tone of 190 c.p.s. 
When SS is asked to give his opinion of a set of 
statements, he is engaged in a task that makes 


or even a red light or a 
and be guided by them. 


sense as presented, and consequently he will 
not attend so closely to E’s reactions. In this 
situation the muttered ““Mm-hmm”’ should 


have no effect. The difficulty is, of course, that 
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Ss in our experiment and in Greenspoon’s 
nearly all said they had been quite unaware of 
E’s ‘“‘Mm-hmm.” In other words the two sets 
of data give no indication of differential at- 
tention to E’s reactions and so do not support 
our explanation. Still, the explanation may be 
correct and our measures of awareness insuffi- 
ciently sensitive. 

There are many questions one might ask to 
test S’s awareness. Does he know the purpose 
of the experiment? Did he notice anything 
about £’s behavior? Did he notice that E said 
“Mm-hmm?” that 
“‘Mm-hmms” have influenced his behavior? 
All of the studies so far find that Ss usually 
level of awareness. 


Does he realize these 


do not attain to this last 
It is not clear from the experimental descrip- 
tions which other questions were asked nor 
what the answers were. it is possible, there- 
fore, that reinforcers are more effective on the 
higher levels of awareness than on the lower 
levels and that these levels have not been ade- 
quately distinguished in the studies reported 
We did ask our Ss several questions before 
inquiring whether they knew about E£’s in- 
fluence on their behavior. Eight Ss were aware 


that E had said ‘“‘Good” while only one S 
noticed ““Mm-hmm.” Evidently ‘Good’ is 
more “visible” in the interviewer’s role than 


is ‘‘Mm-hmm.”’ Probably this is because saying 
**Good”’ 
nondirective character of that 
may help to ac- 


very nearly violates the prescribed 
role. This is a 
difference of awareness that 
count for the effectiveness in our study of 
“Good” and the “\Im- 
hmm.” We also asked our Ss to guess at E’ 
opinion of General Education. We thought it 


ineffectiveness of 


quite possible that while S usually fails to 
notice E’s responses he might react to then 

forming a conception of E’s opinions and this 
verbalizable when its 
Chis 


in that all 


conception might be 


behavioral sources were not. measure 
disappointed us in the present case, 
but one S thought E favored General Educa- 
tion. However, there is a tendency for the Ss 
who were reinforced for anti-General Educa- 
tion answers to attribute to E a somewhat less 
favorable attitude than do the Ss reinforced 
for pro- responses. Furthermore the Ss whose 
conceptions of E’s attitude show more sensi- 
tivity to the pattern of reinforcement received 
are also the Ss whose own attitude scores seem 


to have been most influenced by E’s reactions. 








While all of these trends fall short of signifi- 
cance in the present study, they do open the 
possibility that levels of awareness will help 
to account for differences in the effectiveness 


of selectively interpolated experimenter re- 
actions 

here is a further point of contrast between 

results and those of Greenspoon. Since a 
light and a tone have been used as reinforcers 
it would seem that semantics plays no neces 
sary part in this phenomenon. Yet we find 
“Good” effective and ‘‘Mm-hmm”’ ineffective, 
and we are inclined to think that this may be 
partly due to the fact that “‘Good”’ has a more 
clearly favorable meaning than ‘“Mm-hmm.” 
The meaning of ‘‘Mm-hmm”’ is altogether de- 
pendent on the intonation pattern. As a se- 
quence of segmental phonemes, as a printed 
word, its meaning is ambiguous. It may be a 
neutral indication that one is listening or it 
may even be questioning or disapproving. 
“Good” is somewhat less dependent on in- 
tonation. It is more reliably a favorable sign. 
This semantic difference may help to account 
for the superior effectiveness of ‘‘Good.”’ 

Our experimental procedure has brought the 
verbal reinforcement experiment nearer the 
interview situation. It may eventually prove 
to be necessary to train interviewers to control 
their specific reactions to the content received 
from an informant. Some of these reactions 
(smiles, averted eyes, etc.) may not be sus- 
ceptible of control. It may be wiser, in a study 
of public opinion, to use interviewers of op- 
posed bias, letting their reinforcing reactions 
yperate freely but hoping the effects will cancel 
one another in the total sample. Pollsters might 
even learn much about opinion stability by 
' ying interviewers deliberately take different 
sides for different Ss. 

In any case, there is much to be learned 
about the reinforcement of verbal behavior 
before worrying unduly about its effects in 
the interview. It has not yet been shown, for 
instance, that a reaction like “Mm-hmm” is 
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selectively used to express approval in ordinary 
conversations or interviews. The reaction can 
be selectively interpolated by an experimenter 
but zs it selectively interpolated outside the 
laboratory? Can one predict the naive speaker’s 
use of ‘“‘Mm-hmm” from knowledge of his 
attitudes? Even if this be possible, it does not 
follow that the usage of either the clinician or 
the opinion interviewer is similarly predictable. 
Their efforts to play a nondirective role may 
well change the ordinary usage of “‘Mm-hmm.” 
Actual interviewer behavior needs to be 
studied before deciding on the importance of 
the studies of verbal reinforcement for the 
interview. 


SUMMARY 


A questionnaire was administered by tele- 
phone and the interviewer attempted to in- 
fluence his Ss through the selective in- 
terpolation of two reactions—‘‘Good” and 
“\Mm-hmm.” ‘‘Good” proved to bias the 
results obtained while ““Mm-hmm’”’ did not. 
The study was repeated and the same result 
obtained. These results were compared with 
those obtained by other experiments. The 
implications of the verbal reinforcement phe- 
nomenon for the clinical and opinion interview 
are discussed. 
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THE GENERALITY OF COGNITIVE COMPLEXITY IN THE 
PERCEPTION OF PEOPLE AND INKBLOTS 
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HEORIES of | 


or cognitive constructs have found it 


yehavior that use perceptual 


/ 
‘ 


necessary to postulate some organizing 


or schematizing process which is held responsi- 
ble for the active interpretation and representa- 
tion of external events to the organism. The 


+ 


ego of Freudian theory is considered to mediate 
between the environment and the organism, 


while Lewin speaks of the “functional firmness 


of the boundaries between individual and 
environment” in discussing the impact of the 


environment upon the individual (11, p. 107 
(14 


“cognitive map” of the organism 


Tolman discusses the conditions under 


which tl 
operates as an intervening variable in behavior. 
Bartlett (1) 
to refer to the organization of previous experi- 
ence which effects behavior in a current situa- 


1e 


utilizes the concept of “schema” 


tion. Piaget (12) discusses the developmental 
aspects of the assimilation and differentiation 
of the environment. In a recent formulation, 
Kelly (9) has utilized the notion of a “personal 
to help explain the indi- 


¢ 


construct 
vidual’s behavior in actively perceiving and 
responding to the meaningful environment. 


system”’ 


All such formulations imply two things about 
the role of perception in behavior. First, per- 
ception is an active process involving a 
transformation of sensory data into a con- 
ceptual scheme consistent with the previous 
learning and experience of the individual. 
Secondly, the conceptual scheme is structured 
differently from one individual to another, and 
an understanding of these structural differences 
is of value in predicting the behavior of the 
individual. Thus, in Freudian theory the ego 
has certain reality-testing functions which are 
affected by its stage of development and its 
characteristic defenses. Lewin speaks of the 
degree of differentiation of the inner personal 
regions of the life space and Tolman discusses 
the effects of broad and narrow cognitive maps 
on behavior. If we assume that every individual 
has a cognitive system or scheme for construing 
his environment, there remains the problem 


‘This study was facilitated by a grant from the 


Laboratory of Social Relations, Harvard University. 
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of how we characterize the structure of this 
system so as to obtain meaningful predictions 
about behavior. 

In an earlier paper (3), the position was 
taken that one way of approaching the prob- 
lem of cognitive structure was to posit a dimen- 
sion of complexity-simplicity. That an 
inference can be drawn about the relative 
complexity of an individual’s cognitive system 
from the measured complexity of his percep- 
tions of external events. Operationally, com- 
plexity is measured by the variability of 
responses given to a finite realm of stimuli. 
Essentially, we are asking, in many 
different ways can a person perceive a certain 
set of events? The assumption is made that 
the more differentiated the system is relative 
to these events, i.e., the more alternative per- 
ceptions of an event are available, then the 
more alternative bekaviors will be available in 
that situation. 

An important question arises as to the gen- 
erality of an individual’s cognitive system 
from situation to situation. It seems evident 
that an individual with a developmental! 
pattern of experiences varying from very in- 
tensive in some situations to little experience 
in others will have learned to approach these 
different situations with varying levels of 
cognitive complexity. However, it seems un- 


is, 


how 


wise to postulate a capricious specificity to 
the individual’s cognitive behavior from 
situation to situation. Rather, it is assumed 
that relatively consistent, enduring medes of 
cognitive schematization will characterize the 
individual’s behavior across situations. Geneti- 
cally, we assume that as the individual’s 
cognitive system develops in one realm of 
experience, it will tend to generalize to some 
extent to new realms of experience subse- 
quently encountered by the individual. Vernon 
(15) has recently discussed the functions of 
“schemata” in different types of experimental 
perceptual situations. Vernon and others have 
criticized experimental studies of form quali- 


ties, constancy, and motivational factors as 


perhaps relying too much on transitory, 
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artificial states rather than upon states and 
situations more readily encountered in every- 
day life. 

In line with this criticism, and in view of the 
importance of the perception of people for 
personality theory, we would emphasize the 
primary importance of the individual’s cogni- 
tive system as it is organized about the 
perception of people in his life space. Thus, 
among the first objects in the environment 
with which the infant has intimate contact are 
people. As most personality theories stress, 
the period of infancy and early childhood is 
one in which important, perhaps basic, per- 
ceptions and differentiations are developed 
relative to the future course of the individual’s 
interpersonal experience. Might it not be then, 
that the structure of the cognitive system the 
child learns in relation to his perception of 
people is basic in determining the structure of 
the system relative to other stimuli and events 
in his life? Without being concerned now about 
the causal sequence involved, we can formulate 
the fundamental problem to be investigated in 
the present study: Can there be established a 
degree of generality between the complexity of 
the individual’s cognitive system in the per- 
ception of people and in the perception of 
events that fall outside the immediate realm 
of interpersonal experience? 

The measure of cognitive complexity in the 
perception of people that is used in this study 
has evolved from a series of attempts (3, 9) to 
express objectively the degree of differentiation 
an individual possesses in his perception of 
others. This measure will be described in 
detail in the section on method. The measure 
of complexity of cognitive behavior using non- 
human stimuli was derived from the behavior 
elicited on a modified Rorschach-test situation. 
Rorschach inkblots were used net only because 
of the importance and relevance of this in- 
strument to clinical practice and research, but 
also because it affords two kinds of behavior 
which can be analyzed, namely, the defer- 
minants utilized in giving a response and the 
content of the response itself. In a manner to be 
discussed below, both the determinants given 
by an individual and the content given by an 
individual can be conceptualized as to their 
relative complexity. Again, it should be men- 
tioned that the basic operation for measure- 
ment of cognitive complexity is essentially 


response variability to a prescribed realm of 
stimuli. 

With these considerations in mind, we are 
now in a position to state the hypotheses to be 
tested: 

I. Significant positive relationships exist 
between the complexity of subjects’ percep- 
tions of people and the complexity of their 
perceptions of inkblots relative to the deter- 
minants used. 

II. Significant positive relationships exist 
between the complexity of subjects’ percep- 
tions of people and the complexity of their 
perceptions of inkblots relative to the content 
elicited. 


METHOD 


Subjects (Ss) in this study consisted of 40 male 
university undergraduates who were paid to serve in 
psychological experiments. 


Complexity Measures 


Perception of people. The Repertory Test (Rep Test) 
devised by Kelly (9) for evoking “personal constructs” 
was used to obtain the perceptions of others. This tech- 
nique consists of presenting S with the names of three 
persons known to him and asking S to sort the three in 
such a manner that two are perceived as alike or similar 
in some important personal way and different from the 
third in this respect. After S states the similarity of 
the two sorted together, he is asked the opposite of this 
perceived characteristic. Thus, S may sort two persons 
as similar to each other and different from the third 
because the two are “sincere,” and the opposite of this 
for S might be “insincere.” After the initial perception 
and its opposite are obtained, S is presented with three 
more persons and asked to sort them in an analogous 
fashion. The names of the persons sorted have been 
placed on cards by S according to a standard list of “role 
titles” supplied by the experimenter (£). In the present 
study, the following role titles were used on the Rep 
Test: 

a. Yourself 

b. Your brother closest to you in age (or person 
most like a brother) 

Your closest girl friend 

d. The most successful person whom you know per- 

sonally 

e. Someone you know personally whom you admire 

f. Someone you know personally whom you would 

like to help or for whom you feel sorry 

These role titles were selected from a larger group of 
role descriptions used in previous research as offering 
maximum discrimination in terms of perceptions elic- 
ited. The six persons were sorted by S in all possible 
combinations of three such that no three names ap- 
peared together in a sort more than once. In all, 20 
sorts were thus obtained, with a perception of similarity 
and its opposite being obtained on each sort. Essen- 
tially, then, S was confronted with a task in which he 
was asked to perceive a fixed number of stimuli in dif- 


~ 
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ferent combin ns. The complexity of S’s perceptions 
in terms of the number of dif- 
) 


If, in 


I 
of others was measured 
) 


NDI 


giving a perceived similarity and difference on a sort, 


ferent perceptions S gave in the 20 sorts 


rity or difference which 


S gave the same 
he had given on a previous sort, it was scored as a re 


It sh 


criterion of response repetition was used, i.e 


verbal simi 
noted that a stringent 
S had to 
ame word or words in his perceived similarity 


peated perception vuld be 


use the 
or difierence as used prev Theoretically, S could 
receive a maximum complexity or NDP score of 20 or 
minimum NDP score of one 

he NDP scoring 


he reliability of this scorir 


was done by one of the authors 


g was obtained by selecting 
ten protocols at random and having a judge score these 
records independently according to the criteria outlined 
coefficient for these 


of the sta 


above. The interscorer reliability 
ten 92. In addition, an estimate 
bility of the NDP measure was obtained by repeating 
the 


two 


cases was 


sorting procedure with 19 Ss after an interval of 
The test 
82. The 
indicates an adequate degree of stability in this measure 
of the ce 


Perception 


weeks retest reliability coefficient ob- 


tained was magnitude of this latter coefficient 
npiexity f perceptions of others 

Approximately two to three 
ach § 


pro 


snkblot 
rest 


the n 


the Rep ministered, ¢ 


tually 


was a 


«ified 


weeks after 


was given Ro h 
| have a constant series of 


repeated so 


cedure 
stimuli I . ies that would be 
for variable interpretations to 

Thus, both the Ror 

and the Rep Test contain the crucial 


ur complexity 


as to alliora « 

be evoked 

schach situatior 

features necessary for ¢ measure 

modification used has been described 

It consists of Ror- 
' 


t ten cards. 


The Rorschacl 
in detail in another paper (4 ten 
letails (D), 


s of the blot are 


schach larg« one from each of the 


All other portior 
which exposes only the desired blot por- 


blocked out by means 
ot a template 
tion I | 


matic 


lots are chromatic and five are achro- 


used selected 


were 
] 


because of their judged ability to evoke a variety of 
f content and in terms of de 
2) for the ten details 
order of administration, are: Card I, 
D 2); Card IV, (D 1); Card III, (D2 
D 4); Card VIII, (D 2); Card VI, (D 4); 
D 1); Card VII, (D9); Card X, (D1 


presented to S using standard 


responses, 
notation 


Card IX, 
The 

Rorschach ir 
S istru ad k he 


il it were 
pt for the following changes 
whole blot portion 
lot was removed as soon 

S started to rotate the 

in the position in which 

n S had given one response 
old that he would be asked 

to go throt d this time he was to tell 
Following this second 
with the 


what else each could look 


hat else each 

again presented to S 
After this third series, an inquiry was 
Thus, 


1 


from each blot, making 


manner, card by card 


obtained 


by each S 


h records were scored according to the 


usual 


conducted in the 
three responses wert 
a total of 30 responses giver 


41] 
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Klopfer scheme. | sing ten randomly selected records, 
interscorer reliability 85 for all 
The average agreement for the ten 


was found to average 
scoring categories 
records was 83 per cent Because of the relative infre- 
quency of the various shading responses of the Klopfer 
all such responses were scores as Sh. Thus, seven 


namely M, FM, 


responses OC urred 


type, 


determinant categories were scored 
Fm, F, Sh, FC, and Cl 


in only two records, they were treated as CF responses 


Because ( 


for our present purposes 

general measures of 
the 
t complexity. Since 
or content 
measures of 


As mentioned previously, two 


complexity were derived from Rorschach, i.e., 
determinant complexity and conter 
measure of either determinant 
has loped, 


each were used. Determinant con 


no one basic 


complexity been deve several 


plexity was measured 
the total 
number of different determinants used by S in his 30 
b) the total 
repe ated the same determinant at least two times; (« 


empirically in four ways by counting: (4 


responses; number of cards on which S 
the total number of cards on which S repeated the same 


determinant three times; (d) the total number of cards 


on which S gave three different determi 
a and d and low 
Content 
in two ways by 


nants 
It is apparent that high scores on 
reflect 
complexity was measured speci y 
counting: (a) the total numl ponses which were 
repe ated by S on the same card; (6) the 


scores on 5 and ¢ greater complexity 
eT OI T 
totai number 
of cards on which a response was repeated. Low scores 
on both the a and 4 content measures reflect greater 
Interscorer reliability coefficients on both 
of these content-complexity measures, using ten ran 
84 and .86 respectiveiy. 
NDP and Rorschach 


Table 1 


complexity 


domly selected protoc ols, were 
The distributions of scores of 
complexity measures are presented in 


th 
tne 


RESULTS 


The two experimental hypotheses were 
tested by correlating each S’s NDP score 
(complexity of perceptions of people) with his 
complexity scores on the modified Rorschach. 
All complexity scores were found to be ap- 


proximately normally distributed (Table 1). 


TABLE 1 
DISTRIBUTIONS OF SCORES OF THE NDP 


RorscHacH CoMPLEXITY MEAsurREsS (N = 


AND 
40 
Mean Median SD Range 
NDP 
Determinant 
complexity 


15.52 


16.63 3.41 


a 
b 
c 
d 
Content 
plexity 


com 


a A 
b Sim 3.50 


* Includes one subject with extreme score of 26 
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Using the Pearson product-moment coefficient, 
each of the six Rorschach complexity measures 
was correlated with the NDP measure. These 
findings are presented in Table 2. 

It will be noted in Table 2 that Hypothesis 
I is supported by significant correlations be- 
tween the NDP measure and each of the four 
Rorschach determinant-complexity measures. 
Similarly, Hypothesis II is supported by the 
two significant correlations between the NDP 
the Rorschach content-com- 


measure and 


plexity measures. It is apparent that the 


various Rorschach measures of complexity 
are interrelated because of the way in which 
they were empirically derived. This would 
explain the relative homogeneity of the various 
correlations presented in Table 2. Inspection of 
Table 2 the content 
measures of complexity and the Rorschach 


indicates Rorschach 


determinant-complexity correlate 


about equally with the NDP measure. 


measures 


Complexity of response to two stimulus 
realms, 1.e., people and inkblots, has been 
measured in this study by means of verbal 
behavior. Consideration of the verbal measures 
used might suggest that a variable such as 
verbal intelligence or fluency might account 
for much of the interrelatedness of behavior 
on these two perceptual tasks. In order to 
analyze this possibility, each S was given the 
vocabulary subscale of the Wechsler-Bellevue 
(Form I). These vocabulary scores were then 
correlated with the various Rorschach com- 
plexity measures and with the NDP measure 
It was found that there were indeed significant 
four of the Rorschact 

and the 
scores. The absolute range of these relation- 
ships was from r = .37 .46, the most 
pronounced correlations being found between 


correlations between 


complexity measures vocabulary 


tor 


the content-complexity measures and vocabu- 
lary. Onthe other hand, no correlation 
(r —.01) found between the NDP 
measure and vocabulary scores. Thus, we may 
assume that verbal facility, to the extent it is 
measured by vocabulary, is not responsible 
for the intercorrelations obtained in testing 
the experimental hypotheses. Indeed, when we 
partial out the effects of vocabulary on the 


was 


relationships between the complexity measures, 
we find the correlations are increased, although 
not significantly. Parenthetically, the fact that 
the Rorschach measures correlated higher with 


COGNITIVE 


COMPLEXITY 


TABLE 2 
INTERCORRELATION OF RORSCHACH-DETERMINANT AND 
CONTENT-COMPLEXITY SCORES WiTH COMPLEXITY 
SCORES IN THE PERCEPTION OF PEOPLE 


N = 40 


= 


Direction 
of »* 


Predictior 


| 2 positive 


negative 
s repeated 


ber of cards with t negative 


positive 
t determinants 


Hypothesis II 


a. Number of responses repeated negative 05 


b. Number of cards on which a re negative 005 


sponse was repeated 


* One-tailed test 


the intelligence measure than did the NDP 
measure might be related to the fact that just 
as every S had to define the same words on the 
vocabulary test, every S perceived the same 
blot stimuli. On the Rep Test, however, each 
S perceived persons different from those per- 
ceived by other Ss. 

Do any of the traditional Rorschach deter- 
minant categories bear any relationship to the 
Ss’ ability to achieve complexity of response 
in their perceptions of people? In terms of 
standard Rorschach rationale, it might be 
expected that the M or human movement 
response would be related to complexity of the 
perception of people. In the first place, M is a 
perception in which a person is the content. 
Secondly, a common interpretation given the 
M response is that it involves more inter- 
nalized, introversive, and possibly imaginative 
ways of approaching problems concerning the 
self and other. The results of correlating the 
various Rorschach determinants with the 
NDP measure are in line with this view. The 
correlation between M and NDP was .31 
(p = .05, two-tailed test). Further, this was 
the only determinant which correlated with 
the NDP measure ai a level significantly differ- 
ent from zero, the other correlations ranging 
from .04 to —.06. Thus, it would appear that 
the greater the tendency to perceive humans 
in inkblots the greater is the tendency to 
invoke more variable or complex perceptions 
of people on the sorting task. 
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DISCUSSION 


The results of this study suggest that some 
degree of generality in the complexity of Ss’ 
behavior can be demonstrated using two per- 
ceptual tasks involving personal and non- 
personal stimuli. It is assumed that this 
generality of behavior is referable to the 
complexity of the cognitive system of the 
individual. That is, the manner in which an 
individual structures and cognizes one realm 
of events bears some relationship to how he 
structures another realm of events. It must be 
emphasized, however, that this is not a chance 
sampling of events. In the present study, the 
underlying formulation has been that the 
individual’s learning experiences in the realm 
of interpersonal relationships provide the 
basic core from which his cognitive system for 
construing the world is developed. The fact 
that there is some generality from the per- 
sonal to the nonpersonal realms in the cognitive 
behavior of individuals suggests the trans- 
situational pervasiveness of some sort of cogni- 
tive or perceptual “attitudes” in human 
behavior, as other workers have posited (10). 
Our results are also compatible with studies 
utilizing the Rorschach as a measure of per- 
ceptual rigidity (5, 6, 7, 8). 

The theoretical problem remains of relating 
a concept such as the complexity of the cogni- 
tive system to the behavior of the individual. 
Clinically, we are often impressed by the 
manner in which patients may generalize be- 
havior from one situation to another, perhaps 
“unrelated,” situation. Reacting towards 
others “as if” they represented some earlier 
interpersonal relationships has been stressed 
by Sullivan in his concept of the “parataxic” 
mode of behavior. What is involved here is the 
way in which a patient conceptualizes other 
persons of importance to him, the way in 
which gradients of similarity and dissimilarity 
develop in perceiving others, and how subtle 
or crassly stereotyped the patient may be in 
these perceptions. These are among some of 
the behaviors which would seem to be related 
to such a formulation as the complexity of the 
individual’s cognitive system. A beginning has 
been made in this direction by relating the 
cognitive-complexity variable to the predictive 
accuracy of the individual’s behavior (3). 

It is important that the methodological 
problems in obtaining an over-all measure of 
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the degree of differentiation in one’s percep- 
tions of others be recognized. Although it has 
been demonstrated that an adequate degree 
of stability exists over time for the measure of 
cognitive complexity in the perception of 
people, certain basic problems are posed. For 
example, the sampling of the people involved 
in measuring the complexity of perceptions of 
others is crucial. They may be peers, family 
members, old, young, same sex, or opposite 
sex. Many people may be sampled, or only a 
few. It is apparent that generalization of find- 
ings such as those obtained in the present 
study should be limited by a consideration of 
these factors. 

The use of the modified Rorschach blots in 
this study underscores a recognized but ne- 
glected area of investigation concerning 
Rorschach behavior, namely, the importance 
of variability of response both as to deter- 
minants and content. The fact that it was 
possible to demonstrate significant relation- 
ships between inkblot-response variability and 
degree of differentiation in perceiving others 
suggests the importance of such an approach 
to Rorschach performance. While perseverative 
and repetitive behavior on the Rorschach 
has received most clinical and research atten- 
tion relative to certain diagnostic patterns, 
such as organicity and menta! deficiency, the 
results of this study indicate that concep- 
tualizing and analyzing response variability or 
complexity as a function of the general cogni- 
tive system of the normal individual may add 
an important dimension in_ incorporating 
Rorschach behavior into a systematic theoreti- 
cal framework. In addition, the finding that 
content complexity is ai Jeast as related to 
complexity in perceiving people as is deter- 
minant complexity, is consistent with the 
increasing emphasis being placed upon content 
interpretation in Rorschach practice (13). 


SUMMARY 


The assumption is made that fundamental 
to an understanding of human behavior from 
a perceptual viewpoint is the underlying 
structure of the individual’s cognitive system 
with which he actively interprets his social 
world. The structure of this system can be 
delineated as to its relative complexity-sim- 
plicity. It was reasoned that the generality of 
cognitive complexity, operationally defined by 
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response variability, could be measured be- 
tween a perceptual situation involving persons 
as stimuli and a perceptual! task using inkblots 
as stimuli. Two experimental hypotheses were 
derived: I, Significant positive relationships 
exist between the complexity of Ss’ perceptions 
of people and the complexity of their percep- 
tions of inkblots relative to the determinants 
used and II, Significant positive relationships 
exist between the complexity of Ss’ perceptions 
of people and their perceptions of inkblots 
relative to the content elicited. A measure of 
cognitive complexity of persons was derived 
from a _ concept-formation sorting task 
originally devised by Kelly (9). The stability 
of this measure over time was demonstrated. 
A modified Rorschach inkblot task was used 
to measure cognitive complexity in the non- 


personal stimulus realm. The sample consisted 


of 40 undergraduate college males. 

Significant relationships in the predicted 
direction were found for both experimental 
hypotheses, using several indices of complexity 
of response in the Rorschach situation for 
determinants and for content. Although in- 
telligence correlated significantly with the 
inkblot measures of complexity, it appeared 
to play no role in producing the generality of 
cognitive complexity which was found. Of the 
various Rorschach determinant categories, 
only M correlated with the measure of com- 
plexity of perceptions of others at a level sig- 
nificantly different from zero. The theoretical 
and clinical significance of a variable such as 
cognitive complexity is discussed. 
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THE EFFECT OF STIMULUS AND BACKGROUND FACTORS ON 
THE VOLUNTEERING RESPONSE 


MILTON E. ROSENBAUM! 


Ty 


cn 
N AN earlier experiment (2) the effect of 
permitting subjects (Ss) to observe the 
response of other people to a standard in- 
vitation to volunteer for a task before the same 
invitation is tendered to the Ss was studied. 
It was found that Ss’ acceptance of the in- 
vitation was significantly greater if they ob- 
served another person accepting the invitation 
than if they observed rejection of the invita- 
tion or were not permitted to observe the re- 
sponse of others in a control condition. Signifi- 
cantly less acceptance occurred when rejection 
by others was observed. These findings were 
interpreted as indicating that behavior within 
social situations is the product of a judgmental 
act in which S is making use of all effective por- 
tions of a stimulus complex to determine be- 
havior appropriate for him. 

Deriving its orientation from Helson’s the- 
ory of adaptation level (1), the present experi- 
ment elaborates the former by systematically 
studying the contribution of the stimulus in the 
immediate focus of attention in conjunction 
with presentation of background stimuli in the 
form of the responses of other people. 

Basic design. The general plan of the study 
called for creating test situations within which 
invitations to participate in a psychological ex- 
periment that varied in strength were tendered 
to Ss after they had seen the reaction of an- 
other person to the same request, with the ob- 
served reaction also being subjected to sys- 
tematic variation. Three stimulus requests 
were used, designed to vary along an intensity 
continuum in terms of their capacity to pro- 
duce a volunteering response. Positive and 
negative backgrounds and a control condition 
or neutral background were introduced to de- 

1 This report is based upon a portion of a dissertation 


submitted in partial fulfillment of the requirements for 
the degree of Doctor of Philosophy at The University 


of Texas. The writer is indebted to Dr. Robert R. 
Blake for his guidance as thesis advisor. Appreciation 
is also extended to Dr. Harry Helson for theoretical 


and interpretive assistance. Special gratitude is owed 
Joseph A. Marin for his splendid cooperation in collect- 
ing the data. Dr. John Thibaut and Dr. W. Grant 
Dahlstrom offered valuable advice in the preparation 


of this paper 


iversity of North Carolina 


termine the rate of volunteering in conjunction 
with each of the stimulus requests. 
Hypotheses. The variations of stimulus re- 
quest strength and the character of social back- 
ground mentioned above lead to the following 
hypotheses: (a) willingness to volunteer is posi- 
tively related to stimulus-request intensity, 
and (6) willingness to volunteer is directly re- 
lated to the behavior of others who are ob- 
served responding to the same invitation. 


METHOD 


Setting and subjects. The study was conducted during 
the regular Fall semester of 1953-54 at The University 
of Texas. The settings were the reading rooms of two 
large university libraries. The Ss, numbering 135 males, 
were occupants of the library going about their every- 
day school affairs. Both libraries were divided into six 
geographical areas. On each experimental occasion one 
of these locations was selected by a counterbalanced 
plan, and those students were chosen to be Ss who were 
seated in positions in the designated area that per 
mitted the seating of an experimenter’s (£’s) assistant 
next to them. Fifteen test Ss were employed under 
each of the nine conditions to which they were randomly 
assigned 

{ssistants. Fifteen assistants participated and each 
served on six different testing occasions, once under 
each of the six experimental conditions requiring an 
randomly assigned 
from 
On each 


assistant, to which they were 
The assistants were all 


graduate and graduate psychology courses 


males recruited under- 
testing occasion the assistant was instructed concerning 
the response he was to give. He was not informed in 
advance of the intensity of the request that would be 
used on any specific test occasion 

Experimenters. Two male Es « 
proper. Both were graduate psychology students. Their 


nducted the study 


administration of the experimental conditions was 
randomized over the nine conditions. Although one of 
them was aware of both the request intensity and the 
background that would be used in any given test 


situation, the other E was aware of the request intensity 
He did not know what the assistant’s response 
given. A 


only 
would be until the response was actually 

? It is possible that one of the Es might have induced 
different rates of acceptance and rejection of the 
invitation than the other. A comparison of the fre- 
quencies of acceptance and rejection obtained by the 
two yields a x? of .552 which with one degree of freedom 
indicates a p value greater than .30 and demonstrates 
that no significant difference occurred in the manner 
in which the two conducted their roles in the experi 


ment 
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continuing observational check was made to insure the 
consistency of the behavior of the two Es.? 
Development of intensity differences in the 
requests. A pilot study was conducted in order to 
devise stimulus-requests that would vary along the 
By using the fre- 


gth of 


dimension of strength or intensity. 
quency of acceptance as a measure of the stre 
the stim 


lius-request it was possible to detern 
frequency with which a particular form of request would 








produce the act of volunteering in the context of a 
neut field. After a number of preliminary trials with 
different versions of the stimulus-requests, three that 


satisfied the requirement of representing discriminably 




















different point the stimulus continuum were selected 
lor se ll ne expt t pr per 
Each of the request ealt with a need for partici 
pants ip chological experiment, but they iffered 
terms ol length < time plec to con ete Uf task 
a the pe zness of the phrasing. The high 
intensity re € was phrased as a plea f subjects to 
help plete ssertati researcl The medium 
intensit re lest is a matter o! tact statement ol! a 
nee r departmental research. The low intensity 
request is Stated in a form presenting a relatively 
weak have S pa pate in an experiment but 
ais¢ nm ing sx 
Cre Nl } round Ferenc Backgro vas 
create the react ! a person invited to pz icipate 
¢ ‘ r t tive test S. By C y ver 
‘ ‘ i int a the test S, E first appr hed 
the ant 4 structed as to th e 
he sl e. The é t S was approached and 
requested to participate in the experiment. The general 
proce ( e( t create each of the cK, ind 
cor iS ad > 


\ 
Positive background. To the invitation to participate 


} 





in the « € ent. the a y saying 
Okay Instead of act reaction 
t ea the ibra imme ately, I asked the 
as i t uit sta g beside his seat. He then 
i ted the t . a procedure did ta w the 
test S to determine the actual amount of time that 
would be re 1ired to complete the experiment 


Only one test S was approached at any given time 
If he accepted the invitation after the assistant had 
E escorted them both to an experimental 
The assistant and the 


ed near! 





test S were administered a rating scale which allowed 





them to indicate the strength of their desire to partici 
\ level-of-aspiration task was 


administered to the test S who upon its completion 


returned to the rary. The level-of-aspiration data are 
not considered in the present study. If the test S 


rejected the invitation, he was asked to fill out the 


rating scale at the library table. The assistant and the 
experimenter then left the library together 

Negative background. The assistant responded to the 
by saying, “No, I’d rather not,” and he 
rhe E then presented the 


invitation to the test S. If he accepted, he was escorted 


invitation 


turned back to his bocks 


from the library and was ziven the rating scale and the 
level-of-aspiration task as described under the positive 
condition. If the test S rejected the invitation, he and 
the assistant were given the rating scale to fill out. The 


E then left the After a reasonable time the 
assistant left the library also 


Neutral background. The S was approached directly 
rt », 


17} 
ubrary 


and tendered the invitation. He was not given the 
opportunity to observe the reaction of another person 
sr this condition. The EZ acted upon S’s response 





appropriately and administered the rating scale 
Phe 


vith selection of a position at which the assistant could 


experimental situations began on the hour 
ye seated. At a quarter past the hour, after interaction 
with the first test S had been completed, a second 
experimental situation was begun. Another location was 
selected where an assistant could take a seat next to 
another naive S. The manipulation of the background 
conditions was conducted in a manner consistent with 
the naturalness of the setting. There were no indications 


that deception was suspected. The high degree of 


interest of the volunteering Ss in the level-of-aspiratien 
task d to a belief that the manipulations were 
seen as a usual process in recruiting Ss for psychological 
exper r s 


Veasurement of response. Frequency of acceptance 


for 


or rejection of the invitation serves as one 





factors, but the measure of chief interest consists of a 








‘ 
rating scale devised for the purpose of measuring the 
intensity of feeling about volunteering. It was intro 
duced because subjective evaluation of responses in the 
earlier experiment indicated that Ss accepted or re- 
rect ( vitat 4 l lere legrees of willing 
ess a unwillingne This cannot be adequately 
r essed b means of the acceptance rejection di- 
< tor 
Che hirteen categories, the 
sevent theoretically since the 
< t undecided re sponse Any 
S ly one half of the scale 
If his he was given only the 
port m willingness to volunteer. 
An S whose response was negative was administered 
the portion dealing with degree of unwillingness to 
vi nteer 


RESULTS AND DISCUSSION 


Comparison with the earlier experiment. The 
medium intensity stimulus was almost identi- 
cal with the standard invitation used in the 
earlier experiment with only a minor change in 
wording in terms of the time implied. Presenta- 
tion of this stimulus in conjunction with the 
background conditions essentially constitutes 
a replication of the earlier experiment. It is of 
interest then to compare the results to check 
the reliability of the previous findings. Signifi- 
cant differences were found in the past among 
the background conditions when frequencies of 
acceptance and rejection were evaluated by 
means of x”. Greatest acceptance occurred un- 
der the positive background and least under 
the negative background. The data necessary 
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TABLE 1 
AND REJECTION UNDER 
STrmuLvs-REQUEST 


FREQUENCIES OF ACCEPTANCE 
THE Meprum INTENSITY 


Background 


Condition Acceptance Rejection 
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supporting the assumption of homogeneity 
necessary for the analysis of variance. 

The analysis of variance is summarized in 
Table 2. the within-sum-of-squares serves as 
the appropriate error term. Both the stimulus 
and background variables are seen to be sig- 


Positive 12 3 nificant in their effect on the response elicited 

Neutral 7 8 in the predicted direction: the higher the in- 

witcectn ° a tensity of the stimulus-request, the greater the 

willingness of the test S to volunteer. On the 

TABLE 2 other hand, the more conducive the back- 

RESULTS OF THE ANALYSIS OF VARIANCE OF THE ground conditions toward the act of volunteer- 

RATING ScaLe Data ing in terms of the observed behavior of an- 

. TE a other person, the greater the willingness of the 

paoiacunadion sananante g manne “pum test S to volunteer. No significant interaction 

Deckersund 2 57.63 10.16* is indicated between the two variables. Both 

Stimulus 2 118.69 20.93* hypotheses therefore are supported by the 
Interaction + 9.73 1.72 results.’ 

wane ; - 9-6 The frequency of volunteering apart from 

* Significant at the .01 level degree of willingness is of interest from a prac- 

TABLE 3 


MEAN SCALE VALUES AND 


FREQUENCIES OF ACCEPTANCE AND REJECTION BY STIMULUS-REQUEST INTENSITY AND 
BACKGROUND CONDITION 


Stimulus-Request Intensity 








Low 








Background High Medium 
Condition ner ee ing : 
Mean Scale Frequency Mean Scale Frequency Mean Scale Frequency 
—— Accept Reject Value Accept Reject Value Accept Reject 
Positive 9.40 12 3 9.07 12 3 7.53 6 9 
Neutral 9.13 12 3 7.40 7 & 4.00 0 15 
Negative 8.60 11 4 5.47 1 14 5.33 1 14 


to evaluate the similar aspects of the present 
experiment are presented in Table 1. 

For the differences among the three back- 
ground conditions, the x? is 19.86 which with 
two degrees of freedom is significant beyond 
the .001 level of confidence. It is apparent 
therefore that the replication within the pres- 
ent experiment of the conditions of the earlier 
experiment yields the same result indicating 
that response to a medium intensity stimulus- 
request is directly related to the character of 
the background condition. 

The relationship of stimulus and background 
to volunteering. The experimental procedure 
permitted an analysis of variance in a 3 X 3 
factorial design. For this purpose the rating- 
scale data were used as the measure of response. 
Bartlett’s test yielded a x” of 10.72, p > .0S, 


tical standpoint. For comparative purposes 
Table 3 presents both the mean values ob- 
tained from the rating-scale data and the 
frequencies of acceptance and rejection of the 
invitation that appeared under the various con- 
ditions. x? analysis of the frequency data sup- 
ports conclusions similar to those obtained 
from the analysis of variance of the rating- 
scale data. x’ values significant beyond the .01 
level are obtained for the relationship of both 
stimulus-request intensity and background 


3 Further analysis in order to evaluate the differences 
that appear between the levels of each variable was 
conducted by means of the ¢ test. It is found that for 
the most part the differences are significant at the .01 
level. The single exception occurs in the comparison of 
the neutral background with the negative background 
for which the p value is greater than .10. 
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condition to volunteering. Again there is no 
interaction effect. The rating-scale measure is 
more sensitive than actual volunteering, how- 
ever, to the effect of the variables. Under three 
sets of combined stimulus and background con- 
ditions in one case, and in two sets in another, 
the frequencies of acceptance of the invitations 
are identical, while the rating-scale data clearly 
differentiate the conditions. 

Although the present results clearly indicate 
that knowledge and control of immediate stim- 
uli and background stimuli lead to predicta- 
bility of a simple social response with a high 
degree of confidence, this experiment did not 
manipulate or control a third major class of 
contributing variables: “residual stimuli” (in 
Helson’s terms) deriving from such factors as 
genetic predispositions, prior experiences, atti- 
tudes, abilities, etc. The contribution of this 
source to the unaccounted-for variance in the 
experimental results was strikingly demon- 
strated in the response of one S. This S, under 
the condition least conducive to volunteering, 
the low stimulus-request intensity and nega- 
tive background condition, accepted the invi- 
tation to volunteer, and on the rating scale 
checked Category 13, ‘‘would not miss for any- 
thing.” He explained spontaneously that he 
was a freshman planning to major in psychol- 
ogy and that this was the first opportunity he 
had been offered to be in direct contact with 
any type of psychological work. Undoubtedly 


responses given by other Ss were to some ex- 
tent conditioned by internal factors such as an 
examination the next hour, fear of psychologi- 
cal experimentation, lack of interest in their 
library studies, and many other individual 
characteristics. Major problems exist in the 
identification and control of the major sources 
of stimulation leading to the appearance of 
social responses but the present results suggest 
the utility of the threefold classification’ dis- 
cussed above. 


SUMMARY 


The response of Ss made up of university 
library readers to an invitation to participate 
in a psychological experiment was studied un- 
der varying conditions of stimulus-request in- 
tensity and background stimulation presented 
in the form of responses of other people in the 
same situation. The background stimulation 
was controlled by use of assistants to the ex- 
perimenter. Both stimulus and background 
factors were found to be significantly related 
to the volunteering response. 
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CASE REPORTS 


RUMOR IN A PRIMITIVE SOCIETY 
RAYMOND FIRTH 


London School of Economics and Political Science 


N a small-scale “‘primitive” society of the 
type usually studied by anthropologists, 
what is the currency of rumor, and how 

do its processes compare with those of rumor 
in more sophisticated societies? These ques- 
tions have not yet been studied. Rumor is a 
subject of which most anthropologists in the 
field seem to have had much experience, gen- 
erally and as affecting themselves. But, curi- 
ously, they have written little about its forma- 
tion, its incidence, and its social repercussions. 
This essay is meant to open up the problem 
by analysis of data from the Polynesian island 
of Tikopia in the Western Pacific.’ It is also 
hoped that the material will be of interest to 
psy‘ hologists. 

What are the main characteristics of rumor? 
They are these: talk or report of hearsay kind, 
not original expression; general currency or 
spread of such report through a social group; 
assertions of doubtful accuracy or unverified.? 
For an anthropologist many questions are in- 
volved in rumor. What is its material or con- 
tent in a primitive society? What themes are 
most commonly involved? How far are they 


Are 


these themes few and repetitive, or do they 


of individual, how far of social concern? 


show a wide range of variation? Is the treat- 
ment of them relatively prosaic, correspond 
'T am indebted greatly to the Behavioral Sciences 


ior the he Ip of Mr J 
contributing toward 


Division of the Ford Foundation 
Spillius, my research assistant, in 
this material, and for secre 
tion of the 
given in Firth (2, 3 

2I drafted this essay originally without seeing the 


by Allport and Postman (1). But 


tarial services In prepara 


article. General data about Tikopia are 


analysis 


interesting | 

the above criteria, which seem to represent the lowest 
common denominator of the concept in popular usage, 
fit fairly closely Allport and Postman’s more precise 
definition: “rumor is a specific (or topical) proposition 
for belief, passed along from person to person, usually 
by word of mouth, without secure standards of evi 
dence being prese nt” (1, p 9 I have been encour- 
aged in the publication of i by discussion 





with Gordon W. Allport, Jerome S. Bruner, David M. 


Schneider and other colleagues at a meeting of the 


Anthropology Colloquium and Social Relations Col- 
the material 


loquium at Harvard, where was first 


presented 


ing fairly closely to ordinary verified ex- 
perience, or is it experimental, with much 
fantasy? Other questions arise regarding the 
originator of rumor, the agents who spread it, 
and the efforts made at its verification or dis- 
proof. There is also the time factor—are 
rumors ephemeral or persistent; are they slow 
or do they come in rapid succession? 

Finally, what is the functional! significance, 
if any, of rumor in relation to the structure 
and organization of the society where it oc- 
curs? Its negative, disruptive functions often 
seem fairly clear; the very use of the term is 
ordinarily pejorative. But can any more posi- 
tive social functions be discerned in its opera- 
tions? Does rumor ever promote group soli- 
darity without apparent scapegoat-creation 
or other social damage? Does it provide indi- 
vidual defense mechanisms against group 
pressures? Can it be a means of group mobili- 
zation for social action? In the following brief 
account I try to provide interim answers to 
some of these questions. 

As a general proposition, I would hold that 
rumor has not just purely negative social 
functions, and that it is not simply the product 
Nut serves 


of idle curiosity or fantasy, as a 


social instrument, helping groups or indi- 


viduals to gain their ends. 
Reporting, Invention, and Rumor 

When I was in Tikopia in 1928-29 and in 
1952, though I did make any detailed 


study of rumor, on both occasions I noted 
many of the rumors encountered and their rela- 


not 


tion to social affairs. My assistant in 1952-53 
did this also. Our records are the basis of the 
following analysis. 

First consider the question of accuracy 
The line between news, ideally the reporting 
of verified events, and rumor, the reporting 
of unverified events, is a very difficult one to 
draw, as every newspaper recognizes. At some 
point between eyewitness and recipient of news 
in any society there must be some degree of 
trust in the veracity and the accuracy of the 
informant. This is one of the requirements of 
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social living. If there should be no trust, then 
nothing but eyewitnessing of events could 
serve as a basis for action. If there should be 
trust but no veracity or accuracy, then social 
living would soon break down through lack of 
correspondence between account and subse- 
quent experience. In any society, however, 
there can be only an approximation to the 
ideal. Facts offered to the eye may be in- 
complete, the observer may see only part of the 
event which he wishes to describe. And par- 
ticularly if any attribution of intention be in- 
cluded in the report, the interest of the ob- 
server himself may lead him to misrepresent 
it. With the recipient then, there always tends 
to be some amount of reserve as to the correct- 
ness of report. All this is a statement of so- 
ciological first principles. How does it apply 
to the case of Tikopia? 

In Tikopia, a vast amount of ordinary news 
is passed on by word of mouth daily. This is 
so especially within any village, and between 
villages in the same district. Folk are con- 
tinually traveling up and down the beach and 
along the inland paths, and exchange of news 
is part of regular social intercourse, as it tends 
to be in any rural area. Between districts there 
is less contact. Formal visiting is not very 
frequent between their inhabitants. When it 
occurs, as for a dance festival or some other 
large-scale ceremony, or when a vessel arrives 
from overseas, the visitors are apt to remain 
separate from the local residents, to hang about 
together under the trees or to sit together in a 
house. But even then there is plenty of ex- 
change of information between the parties. 
And in everyday life people of different vil- 
lages throughout the island meet a great deal, 
as in the cultivations, which are dispersed, 
not concentrated in district ownership. The 
great amount of accurate reporting arising 
from these social contacts must be set as a 
background to the prevalence of rumor. 

The Tikopia have an avid interest in all 
such news. They are eager to hear about events 
in the outside world and also about events, in- 
cluding movements of people, in their own 
community. An arrival from one side of the 
island at a ceremony on the other is usually 
asked first for news. When in 1928-29 I came 
regularly over a gap in the hillcrest from my 
house in Faea to the temple of the chief of 
Kafika in Uta I used to be asked every day 


by him at once “Any news from Faea?’’—and 
I had to tell him about the fishing, any illness 
or death, what the local chief there was doing, 
and so on. In July, 1952, when the same old 
chief was spending the night at his sacred 
glade in Somosomo inland, I joined him there 
early next morning. “Is there not any news 
from the strand?” he inquired immediately. 
“No” was the reply, to his evident disappoint- 
ment. The term which I have translated as 
“news,” taranga, is also that generically for 
“speech,” illustrating the verbal nature of 
communication in Tikopia.* This interest in 
news is presumably a correlate of the small 
size and isolation of the community. The color 
and richness of Tikopia personal experience 
are derived largely from consideration of the 
minutiae of social existence in their tiny area. 
Yet this intense curiosity is matched to a 
surprising degree by an effort at detachment. 
The news is received as conveyed by indi- 
viduals, but often treated as if it stemmed from 
a generic social source. 

As may be expected from a linguistic usage 
which identifies news with speech, the Tikopia 
make no clear-cut categorical distinction be- 
tween news of verified, accurate type and 
rumor, unverified and often inaccurate. But 
indication of some differences is often given 
by the expression @ faranga o faoa, meaning 
“'speec h of the crowd” or “"speec h of people 
in general,” or by the introductory phrase 
“FE ati...,” “It is held that....” The very 
fact that no precise attribution of origin is 
given implies an element of vagueness and the 
possibility of inaccuracy. Such an expression 
conveys a derogatory note, a judgment or 
evaluation in advance of investigation—or at 
least a suspension of acceptance of the story 
as correct. It also dissociates the speaker from 
the statement in advance. An analogous ex- 
pression is “Faoa e muna...”: “people 
say....”’ These are the nearest equivalents 
to our “Rumor has it... .” 

Despite the lack of a very specific Tikopia 
expression for rumor as distinct from news, 
there is a very great interest in the truth of 
accounts received. The question is often asked, 
is the story true (maori) or fa'se (loi)? Is it 
correct (tonu) or not? Such an attitude may 

*Only recently and still very rarely have Tikopia 


communicated by letter or other indirect means, such 


as radio. 
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be also expressed when a story is passed on 
The narrator in telling it may add ‘“‘We don’t 
know whether it or “We 
don’t know whether it is or not.” 
This indicates not only suspension of judgment 


is true or false,” 
correct 


on the veracity or on the accuracy of the ac- 
count; it also indicates that the narrator him- 
self is not committed, and cannot be held re- 
sponsible for the story afterwards. Moreover, 
action may be suspended until verification is 
received. An example of this occurred in Oc- 
tober, 1928. It was reported early in the morn- 
ing that Pa Maevetau, a man of rank in Rofaea, 
had died. I was living in Matautu about half 
a mile away at the time and noted that all the 
people there waited to hear if the report was 
true (maori) or not. Then confirmation came 
by ceremonial whooping and firing of guns 
from Rofaea—a token of some important 
event. Immediately there 
ment in Matautu. People went about whoop- 
ing; they dropped all work; and prepared to 


was great excite- 


form mourning parties. An immediate decision 
was taken to postpone a dance which had been 
arranged. 

Some further features of the Tikopia speech 
background are relevant to the discussion of 
rumor there: the people’s interest in story- 
telling; the accuracy of their memory; their 
capacity for recognized fantasy and invention 

The Tikopia are great narrators. They de- 
light not only in hearing news as items of in- 
formation; they also delight in giving and 
hearing a presentation of news in elaborate 
aesthetic form, with dramatic emphasis. They 
dwell on incidents, the narrator taking time to 
explain in particular his own emotions and 
thoughts as an event takes place. Such narra- 
tives are normally presented as a record of 
actual events. In this the reproduction of re- 
membered items may be often extremely ac- 
curate. In 1952 I heard a Tikopia describe to 
others many details of my movements in 
1928-29. A stone on which I had sat under 
a tree in a pause in a walk twenty-three years 
before was pointed out. A funeral gathering 
was told by one man of a song which I had not 
finished writing down at his dictation then 
(the remaining stanza was dictated to me on 
the spot) and so on. And in 1928 I was given 
details of the visit of H.M.S. “Mohawk” 
thirty years before, including loss of one of 
her anchors off Tikopia—a story which on 
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checking recently (in 1955) I found to be quite 
correct. The Tikopia are fully capable of ac- 
curate accounts from memory. 

As regards fantasy, there are a number of 
Tikopia traditional stories of ‘“‘folk-tale” 
character, some brief, many long and elaborate. 
These are recited informally and often frag- 
mentarily, on no set occasion. One may be 
told when a group of people are resting after 
a ceremony, when members of a household are 
waiting to go to sleep at night, or when a 
grandparent is looking after a grandchild. Such 
tales, though their truth may be a matter of 
suspended judgment, or they may even be re- 
garded as fiction, are usually not regarded as 
specifically invented. Only once did I hear a 
man tell a story which, with some amusement, 
he declared to be an invention of his own. 

Where the element of personal invention 
does come out in a very marked fashion, 
though not so acknowledged by the Tikopia 
themselves, is in the dissociated state of spirit 
mediumship. In this condition a Tikopia 
medium, especially if in a light state of dis- 
sociation, may give a long narrative dealing 
with the adventures of a spirit. But this is re- 
garded by Tikopia as falling in the category 
of a true recital, granted that spirit mediums 
as individuals are recognized as being capable 
of untruth or unverified and inaccurate pro- 
nouncements. 

On the whole, it may be said that the Tikopia 
recognize no great field for free fantasy—with 
one exception, that of song. Here a person is 
allowed free rein to his inventions and the way 
in which human relations are handled in songs 
has often considerable subtlety. There is also 
a wide range of analogy and symbolism in the 
songs. (2, pp. 196-197, 289-302, 520-523). 
But they are apt to be stereotyped and they 
lack the soaring of the imagination which 
characterizes songs in some other primitive 
communities. In perceptiveness about human 
relations the Tikopia are highly skilled, but 
they are not very fertile in imagination in 
fields outside their immediate experience. 
Even within such experience they do not seem 
to indulge in much deliberate fiction or in 
much speculation about possible differences 
from ordinary behavior. On one such occasion 
one of my friends observed in a classical 
style, “If mosquitoes were as big as the birds 
which fly above, when they fed on men, we 
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should disappear.”’ But this is the only remark 
of the kind which I recorded. I think it would 
be fair to say that free fantasy of an intellectual 
order is not common among the Tikopia. What 
is common is free fantasy of an emotional 
order, and this forms the basis of much rumor. 


General Type of Experience Used by Tikopia 

Rumor 

An important question in considering rumor 
is what type of experience does it use? Allport 
and Postman (1) have pointed out how in the 
formation of rumor psychological processes of 
leveling of detail by omission, and conversely, 
of sharpening of detail by selective retention, 
are very relevant. Assimilation (by cultural 
and individual selection of perception, mem- 
ory, and reproduction) also converts an 
incident into narrative material for more gen- 
eral transmittal. Such processes would seem 
to be valid as a description of what happens 
in Tikopia. 

But since a great deal of such process is 
assisted or even short circuited by common 
acceptance of signs, it is pertinent to ask what 
the Tikopia do. In our Western culture some 
signs are selected from natural phenomena, 
others from cultural phenomena. Black clouds 
are a sign of rain; the whistle of a railway 
engine in open country is a sign that a train 
is approaching a level road crossing. Both 
natural and cultural signs may have social 
implications. Rain affects human crops, and 
an engine whistle affects human traffic across 
the railway lines. But with us, ordinarily a 
“natural” sign is an indicator of natural oc- 
currence, and a sociocultural sign, an indicator 
of social occurrence. So is it also with the 
Tikopia. Among natural indicators, black 
clouds likewise are held to denote rain, other 
types of clouds denote wind, and so on. As 
cultural indicators, smoke rising from trees 
gives a sign that an oven is being made ready 
for a ceremony; or gunfire announces the 
death of a man of rank. The Tikopia have great 
interest in such signs or indicators. (They have 
a term fakamailonga, as a generic term for 
indicator.) But a point of interest in Tikopia 
perception and reproduction is that some 
natural events are interpreted as signs of 
social occurrences. A certain kind of ribbed 
cloud structure denotes to the Tikopia not 
merely a kind of weather but also the ap- 


proach of a vessel which is as yet invisible 
below the horizon. Similarly a rainbow or the 
sound of thunder can be a vessel sign. Even 
a sneeze can be so interpreted. 

An important point about these natural 
signs or indicators is that while from our 
Western point of view a social interpretation 
of them is unrealistic or nonempirical, from 
the Tikopia point of view this has a fair de- 
gree of empirical validity. Hence stories based 
upon them are relevant to the classification of 
rumor, but marginal in that classification. A 
story of a ship coming, based on the sight of a 
rainbow, is to be treated as a positive error, 
or mistaken attribution, in which the elements 
of interest-guided perception lie farther back, 
as it were, and have become culturally pre- 
selected and standardized. Moreover, in gen- 
eral such indicators as rainbows, cloud forma- 
tions, sneezes, are taken not as a basis for the 
spread of stories but for speculation. They 
indicate probabilities, but of no high degree. 
People do not pay them much attention, but 
wait to see if the prediction is justified. Often 
the person who makes the identification him- 
self regards the sign with some skepticism, or 
refers to it in terms of possibility—‘TIs it 
perhaps that... ?” 

It may be noted that despite the keen 
interest the Tikopia show in the arrival of 
vessels, and their tendency to seize on in- 
dicators of approach, they have not utilized 
dreams for this purpose. Yet they use dreams 
fairly freely as indicators for foretelling the 
sex of a child, or a catch of fish. This may be 
related to the fact that whereas rainbows, 
clouds, and even sneezes are patent to outside 
observation and are checkable, the occurrence 
of a dream relies solely on individual state- 
ment. Hence the signs which it gives are apt 
to be construed as of individual rather than 
of public significance—that the person con- 
cerned will have the experience predicted. 
That occasionally a dream may provide the 
basis for rumor is shown by a case where after 
a dream a man spread the story that a shark 
caught by a fishing fleet had been caught by 
his clan; it turned out that he was wrong. But 
ordinarily dreams are signs of personal, not 
public, occurrences. 

But if neither cultural signs nor dream ex- 
periences provide the stuff of rumor, it is 
different with the material provided by dis- 
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sociation. Statements issued by spirit mediums 
are heard publicly and are regarded not as 
individual pronouncements of the human 
personality, but as information or instruction 
from the spirit world. Hence they are deemed 
worthy of serious belief. They are the basis of 
many rumors. In 1928, as the result of the 
statement by a spirit medium, it was widely 
believed in Tikopia that if I attended the 
pagan religious rites I would take away their 
sacred power of bringing rain and fertility to 
the crops. Luckily, when I first attended, I 
came in a heavy shower of rain, no other dele- 
terious consequences occurred, and the rumor 
Apart from such types of 
material, rumor in Tikopia is based for the 
most part on misinterpretation of ordinary 
physical events of a cultural order. Examples 


soon died dow n. 


of this are given later. 

A word may be said here about the agents of 
rumor in Tikopia. Children there are widely 
blamed for creating and spreading rumors. In 
this as in many other primitive communities, 
they have a very positive role. Often going 
around in gangs, they form a ubiquitous and 
semiautonomous social element of limited ex- 
perience and without much responsibility. 
There is no doubt that they do in fact gen- 
erate, spread, and elaborate many rumors, 
particularly those concerned with approach of 
vessels and movements of people. But to some 
extent they are a conventional scapegoat, to 
whom authorship of rumor is often shifted 
when its falsity becomes manifest. Many ru- 
mors are the work of adults. Support to this, 
apart from absence of children as narrators, is 
given by the fact that such rumors are often 
clearly in an adult sphere of interest. Women 
seem to be authors of rumors as well as men. 


Content of Tikopia Rumor 

Externally, the content of rumor in Tikopia 
is concerned with two basic subjects: the move- 
ment of vessels in relation to Tikopia, espe- 
cially their arrival; and the fate of Tikopia 
abroad. Internally, it covers a wide range, from 
the timing of ceremonies and doings of men of 
rank (including anthropologists) to quarrels, 
and reasons for illness. In all, from my own 
records in 1928-29 and 1952, I have extracted 
38 specific rumors, and from the records of my 
assistant come another 36, making a total of 
over 70 rumors noted over a period of about 
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28 months. This is a minimal figure, since we 
did not set out to record rumors on all small 
details of behavior, or other relatively ephem- 
eral matters. (Practically all these rumors were 
false.) 

The lack of data on minor rumors, relating 
to ephemeral issues, means that our material 
does not represent a complete cross section of 
Tikopia rumor. It is not completely repre- 
sentative for another reason—1t is biased in the 
direction of rumors affecting our own move- 
ments and intentions. This is natural—for 
operational reasons it was important to know 
what was being said about us or matters in 
which we were concerned, so as to study how 
to handle the situation. Such rumors usually 
came rapidly to us—we were speedily asked, 
as a rule, if they were true. In addition, our 
doings in themselves were the object of the 
greatest curiosity and speculation to the Tiko- 
pia, since the anthropologists have been the 
only Europeans for over a century living on 
Tikopia for any long period. But I think that 
by one channel or another, most of the major 
rumors on Tikopia during our stay reached 
us—that is, most of those which not only 
tended to alter the social behavior of the re- 
cipients significantly, but also affected the or- 
ganization of social affairs on a considerable 
scale. 

Examples of minor and of major rumors will 
illustrate some of the procedures in rumor for- 
mation in Tikopia. 

Minor rumors. One evening I was told that 
my assistant was not returning to our house in 
Ravenga that night, but was going to sleep in 
the dwelling of his friend, the chief of Tafua in 
Faea, where he had been during the day. The 
message was brought to me and others in the 
house of the Taumako chief in the village 
where we lived. It came through a small girl, 
who had been listening to the gossip around 
the bearer of a note from my assistant to me; 
the messenger had come to my house, and was 
waiting for me under a tree when I got back. 
When I read the note, it turned out to be 
merely an announcement that my assistant was 
going to be back late for dinner. This is typical 
of much Tikopia rumor. The delay in the ar- 
rival of my assistant; the arrival of a mes- 
senger from Faea—these are put together with 
a false inference that the intention is stopping 
the night. The inference is not so stupid or far- 
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fetched as a Western observer might think. A 
Tikopia on a visit to the other side of the island 
would not normally send a special message 
back to say that he would be /ate. He might 
send one of his companions, a child or young 
person, to return ahead of him, with instruc- 
tions to say that he was following. But a spe- 
cial messenger wou!d be sent almost certainly 
only to tell his family that he was staying the 
night away. So Tikopia custom gives some 
plausibility to the rumor, and the force of the 
interpretation was added to by the sight of a 
written document, held by them to be a 
weighty thing. But in addition there is the ele- 
ment of interest in novelty. There is more fun 
in speculation about someone’s staying the 
night away than in his returning home. A ru- 
mor of similar exaggerative type came to me 
when a man fell from a breadfruit tree. Some 
boys told me that evening that he had broken 
his leg above the ankle. But his nephew, who 
} 


went and saw him, told me later that he had 


hurt his back and grazed his shin, no more. 
The comment in my notebook at the time (No- 
vember, 1928) was that “rumor is quick to 
exaggerate in Tikopia.”’ 

One further subject of minor rumor which 
may be mentioned is personal illness. Rumors 
in this field range widely from assignment of 
cause to attribution of credit for treatment and 
cure. One instance of this was a rumor that the 
illness of my assistant on a certain occasion 
was cured by a spirit medium—who, in fact, 
worked no cure since he came to our house for 
quite another purpose. As regards causes, ru- 
mors frequently involve flights into the realms 
of spirit action, a subject too elaborate for 
discussion here. 

Major rumors; movement of vessels. A major 
and continued subject of rumor was the move- 
ments and arrival of European vessels. The 
isolation of the island makes the coming of a 
vessel—in normal times, only once or twice a 
year—an event of great importance. For a ves- 
sel to call is a time of excitement, news, valu- 
able exchange, gifts, and perhaps arrival and 
departure of persons. It is much desired, for 
economic as well as social reasons. Hence there 
is much inducement to false identification-—a 
tiny cloud on the horizon looking like a smudge 
of smoke, a distant unidentified object, and a 
cry of ‘The ship! The ship!” rings out. Such 


is the novelty-interest and wish-fulfillment 
type of rumor. 

During the second expedition to Tikopia 
there were at least a score of rumors of vessels 
recorded. This was partly a function of the 
strain and emotional intensity of the period— 
owing to hurricane and drought a food short- 
age had developed into a famine, and prospects 
of food relief were a vital question. It was also 
related to the presence of our radio telephone 
set, with fairly frequent messages. 

Here are some typical cases. On the 5th of 
June, 1952, a rumor began to spread in our 
village about 9 a.m. ‘“‘A ship has come.” Large 
numbers of people, men and girls, went off to 
the other side of the island to see and to trade 
A ship was indeed expected that day with food 
supplies, so it was very likely that some person 
might deceive himself into believing that he 
had seen its smoke on the horizon. But what 
was interesting was the lack of skeptical veri- 
fication. The Tikopia have been deceived in- 
numerable times by such rumors. And each 
time afterwards they talk about “‘7¢e loi o faca”’ 

the falsehood of the crowd. But they seem 
just as gullible next time. On this occasion we 
questioned our retainers as to the accuracy of 
the report. One of them said that a villager, 
one of our neighbors, had gone along and veri- 
fied the story; it was quite true. I asked if the 
vessel was still far off. He replied, “‘Oh, she is 
standing inshore.” But about 10.30 a.m. the 
news came back that the story was entirely 
untrue. The disgruntled crowd streamed back. 

Four days later the vessel arrived with food 
supplies. Ten days later again, when many of 
the supplies had been consumed and there was 
some wistful hoping that further food might 
arrive, another rumor began. It was said that 
a Government vessel would come that day or 
the next; that my assistant and I had an- 
nounced this after hearing it on the radio tele- 
phone. People should sit in their houses, we 
had stated, and not go the woods to get food; 
simply sit and wait for it to be brought by the 
vessel. Popular support was given to this story, 
by the fact that for the last two days the pe- 
culiar cloud formations supposed to be indica- 
tive of approaching vessels had been observed. 
We ourselves had had our attention drawn to 
them—for instance by the old chief of Kafika. 
The basis for this rumor about the ship appar- 
ently was that in a news item from the Solo- 
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mons on the radio the day before, it had actu- 
ally been mentioned that a Government vessel 
was proceeding from Honiara, the capital of 
the Protectorate, to Gizo, a port in the West. 
This is in the opposite direction to Tikopia, 
and more than six hundred miles away, but 
someone in the crowd which always assembled 
whenever the radio was in action (5, p. 810), 
had evidently caught a part of the message and 
garbled it. A day later the rumor took a further 
form—of criticism of my assistant and me, and 
of the radio, for (it was alleged) giving wrong 
information! It was said that we ourselves had 
started the story of the coming of the Govern- 
ment vessel to Tikopia and then it turned out 
not to be true! 

Movement of persons. Another set of rumors 
was concerned with our alleged movements. 
One was that I had tried, a year or so earlier 
(the time varied in different accounts), to re- 
turn to Tikopia. (This was in accordance with 
a statement I was alleged to have made when 
I left in 1929.) But, it was said, I had been pre- 
vented by the Government or by some other 
agency after having got part of the way, and 
turned back. The bases for this rumor were 
dual. On the one hand I did, in 1929, express a 
wish to return to Tikopia, though I had said I 
was uncertain if I ever could. On the other 
hand, I had actually paid a visit in 1951 to 
New Guinea. I had no intention on that occa- 
sion of attempting to reach Tikopia. But the 
airline to the Solomons was by way of New 
Guinea. When in the end after my arrival I 
told some Tikopia about this earlier trip it 
provided the basis for the story of a thwarted 
start for the island. Again, in May, 1952, the 
story became current that, in one case I, and 
in another my research assistant, was going to 
Vanikoro, another island about 120 miles away, 
in a Government vessel to buy tobacco and 
return with it to Tikopia. The basis for this 
was an acute shortage of tobacco in the com- 
munity at this time; several people had half 
jokingly suggested to us that we should send 
out an order for more or go and get more 
without regard, of course, to the finance in- 
volved. Such rumors are of the wish-fulfillment 
type. Their genesis lies basically in an attempt 
to give concrete form to a strong desire. But 
others, also related to the movements of per- 
sons, are essentially an anxiety expression. 

The vessel on which the first shipment of 
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relief food supplies was brought also took away 
a number of Tikopia who had recruited as 
plantation laborers. About seven weeks after 
they had gone the rumor spread that disaster 
had overtaken this labor force—though no de- 
tails were given even as to whether it was on 
sea or on land. Further, it was alleged that my 
assistant and I had received this news on the 
radio-telephone and had concealed it. (Hardly 
necessary to say this was complete fiction.) As 
a variant of this was the rumor that two men 
of this force, cited by name, had died. About a 
week later the story had assumed a different 
form, that two men had died—one of those 
cited earlier, but the other of a different labor 
shipment, at Vanikoro. Again, it was said the 
report had come by radio, but that we had not 
made it known. 

On the one hand, these rumors were reflec- 
tions of the fear of Tikopia for their kinsmen 
away at work, which past experience had 
shown usually resulted in several deaths (2, 
p. 42). On the other hand, they were also ex- 
pressions of the resentment of some Tikopia at 
our restricted use of the radio-telephone as a 
news instrument. Here was a magnificent 
source of news, we were the only channel 
through which it could come, and instead of 
furnishing them with frequent information 
about their kin or any other matter in which 
they were interested, we just listened to 
weather reports and more general news which 
they did not understand, and selfishly engaged 
in business conversations of our own! Such was 
the attitude, not very clearly formulated. In- 
deed, we guessed at this view, rather than 
actually heard it expressed.‘ Such rumecrs, 
fairly directly related to fears and hopes for 
kin, are a reflection in part of the Tikopia 
family structure and its system of sentiments. 

Another rumor with personal referent was 
concerned with the recruitment of the laborers. 


*We did make what inquiries we could about the 
laborers—including report of several deaths—and 
transmitted news about them, apart from keeping the 
Tikopia informed of matters of public interest. In 
particular also, we kept in touch with the Govern- 
ment in Honiara about the approaching food famine. 
This, even apart from matters concerned with the 
organization of the expedition, was difficult owing to 
serious weather distortions and breakdowns, which 
only the devotion of my assistant to the instrument 
was able to remedy. Most Tikopia seemed genuinely 
appreciative of this service, and made this known 
to us. 
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At the time when the vessel called I was asked 
by a man of rank why my assistant was bar- 
ring men of the district where we lived from 
being ‘‘written down,” i.e., having their names 
entered in the recruiter’s book. I asked what 
he meant. He said that my assistant was “writ- 
ing down” names of men of the other district 
to be recruited, but not those of our district. 
Later, two other men of our village said much 
the same to me. The genesis of this rumor was 
an ordinary field inquiry to ascertain the mo- 
tivations and attitudes of those men who were 
offering themselves for recruitment, or who had 
stated they wished to go. Since the vessel had 
anchored off the coast of the other district, and 
its boatswain, a most active recruiter, was a 
Tikopia man from that district, he tended in 
the first place to recruit or approach men who 
were his own neighbors or kin. But the tradi- 
tional suspicion between the people of the two 
districts coupled with the very limited knowl- 
edge of English possessed by most Tikopia, and 
their ignorance of recruiting procedures and 
how to get themselves on the list, gave rise to 
the view that favoritism was being exercised by 
my assistant. 

Rumors based on other fears may have a 
broader sociological referent. One of these in- 
volving alleged movement of both vessel and 
person was associated with the receipt of relief 
food supplies—which came mainly in the form 
of rice. A rumor in July, 1952, after the first 
relief vessel had been gone some time, was to 
the effect that one of the most respected and 
best known of the European priests of the Mel- 
anesian Mission stationed elsewhere in the 
Solomons, had objected to this rice being sent, 
saying, “Shall it be food for pagan religious 
rites?” He said (it was alleged) that if another 
food vessel was sent he would see that it ‘“‘sank 
at sea.”” He had been on Tikopia nearly half a 
century before, and was known by a number of 
Tikopia men. One of my old Tikopia friends 
who had seen this man as a boy asked me “Is 
he someone the spirits desire? He has ‘power’.”’ 

in this rumor there emerge several themes 
characteristic of Tikopia religious thinking. 
The first is the opposition between Christian 
and pagan. In this case it is projected on to the 
distant priest, in actuality a saintly man, 
wholly innocent of such thoughts as the sinking 
of relief ships. The second is the translation of 
that opposition into terms of the outstanding 
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economic preoccupation of the Tikopia—food, 
and its use. The close association in Tikopia 
thought and action between religion and food 
is seen in the whole pagan cycle of the ‘““Work 
of the Gods,” as also in the Christian argument 
that the pagan religion is wasteful of food by 
its offerings and ceremonial accumulations and 
feasts. The third theme is that of the posses- 
sion of supernatural power by men of rank. 
The European missionary’s suggested “‘sink- 
ing” of the vessel alluded to was to have been 
done by supernormal techniques, and not by 
physical sabotage. Hence, the question of my 
friend as to whether the missionary was “‘de- 
sired” by the spirits, meant, did he have their 
supporter 

Violence and social strife. Traditional struc- 
tural elements were reflected in another type 
of rumor concerning the treatment of thieves. 
The rumor became current, as the food situa- 
tion worsened, that if the rice shipment did not 
arrive the chiefs would have given orders that 
thieves would be killed. I told one of the chiefs 
of this. He laughed and said “Oh no!” He ex- 
plained that what had been said was that any- 
one who thieved consistently should be carried 
off to work in the Solomons. This, in fact, was 
the policy of the chiefs, and they did succeed 
in having several thieves recruited as laborers. 
But one traditional punishment in such cases 
was banishment to sea, which was virtually 
equivalent to execution (4). The rumor here 
was doing two things: it was reviving this ex- 
treme punishment in anticipation; on the other 
hand, it was assimilating it to the general type 
of physical rough handling to which thieves, in 
fact, were traditionally subjected. 

A little later we had a repercussion of this 
rumor in the form of a message from the Gov- 
ernment of the Solomon Islands Protectorate, 
stating that it had been reported on Tikopia 
authority that if it had not been for the arrival 
of relief supplies the chiefs would have taken 
the matter into their own hands and killed 
thieves. We were asked for comment. We re- 
plied that the story was without foundation. 
But almost at once two other radio messages 
came from Tikopia police constables at Dis- 
trict headquarters addressed to their kinsfolk 
on the island. Each was from a son to his 
father. One was: “J heard news stop People 
there thieving stop advising you and the family 
not to follow them.” The other was: “J heard 
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news about there stop look after people do not do 
any harm to them.” These messages were in- 
teresting from several points of view. They 
used the father’s personal name in address, 
thereby breaking one of the stringent Tikopia 
taboos. They were in English through an in- 
termediary. They were certainly the first time 
that Tikopia had ever used radio as a means of 
communication with one another. But the most 
significant point in this context is that both 
these young men obviously believed in the pos- 
sibility that the rumor was true. This was 
partly a reflection of their belief, later shown 
to be well founded, that violence was not far 
from the surface of Tikopia public affairs in 
such a time of severe strain and crisis. It was 
also partly an indication of the extent of their 
credence in what that time, a far- 
fetched suggestion. 

As the famine crisis developed rumors of 
violence became more frequent. They took 


was, al 


such forms as this: “An aristocrat was going 
berserk and was going to take his club and kill 
commoners.” ‘““Two prominent executive offi- 
cials were going to kill thieves. Executive 
officials were about to fight among themselves 
Each of these had 
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in a struggle for power.” 
become, indeed, a none-too-remote possibility, 
and each followed the traditional Tikopia cul- 
tural pattern. 

The rumor about the chiefs having thieves 
killed involves the conception of the chiefs as a 
primary source of public order in Tikopia. But 
opposition of interests, as 
interests between 


it implies more 
well as the coincidence of 
chiefs and their families, on the one hand, and 
commoners on the other. This was exemplified 
in another rumor—-that the chiefs were going 
to band together and drive out the commoners 
to seize their lands. This is a theme of long 
standing in Tikopia. I was asked about it 
rather anxiously by leaders of the commoners 
in 1952, but it was then only a fear and de- 
mand for reassurance, not a concrete rumor. 
Later, when my assistant was alone in the 
island and the food situation had degenerated 
still more, it became rumor. At his instance 
there was a meeting of chiefs and executive 
officials in the Ravenga district where he lived, 
to discuss plans for the distribution of a further 
food supply being sent as relief. This meeting 
was interpreted by a spirit medium of the other 
district, Faea, as a meeting to decide on the 
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long-talked-of expulsion of the commoners of 
Faea, to make them set off in their canoes, 
leaving their homes forever. As a result the 
men of Faea bound their heads for war, took 
down clubs, and conferred excitedly, deter- 
mined to resist. On their return, the executive 
officials of Faea were met by armed parties. 
Seeing how the situation stood, they circulated 
among the villages explaining what had really 
occurred. But to allay the fears of the com- 
moners my assistant was asked to go over to 
Faea in person, and make a normal appearance 
as a demonstration that nothing untoward was 
happening. On his way over he was asked on 
two or three occasions, half jokingly, by women 
working in the cultivations, if it was true that 
the aristocratic families were about to expel 
them from Tikopia. Here the dormant solution 
of traditional type was evoked once again in 
time of real crisis. 


Recapitulation and Functional Implications 


I have shown that with the Tikopia, the 
content of rumor is closely related to their ex- 
perience, and that while the total range is wide, 
the main themes are relatively few and repeti- 
tive. While the mode of communication itself 
is almost wholly verbal, Tikopia rumor has 
shown itself capable of seizing upon a new me- 
dium, the radio, and of incorporating it by 
fictitious citation into the rumor content. Psy- 
chologically, the processes of formation would 
appear to be similar to those current in more 
sophisticated societies. But sociologically, the 
types of experience used vary considerably 
from those used in the West, e.g., in the publi: 
reliance placed on the pronouncements of 
spirit mediums.° It is difficult in Tikopia, as 
elsewhere, to identify the agents of rumors. 


But occasionally an original utterer can be 
identified, usually, it would seem, someone 
with a power-axe to grind. The Tikopia prac- 
tice of tending to avoid or disclaim public re- 
sponsibility, however, reduces the possibility of 
this. As regards spread of rumor, its currency 
in time, and the credence given to it, there is 
wide variation in Tikopia. But most rumors are 
ephemeral. They tend to succeed one another 
in fairly rapid succession, especially when the 
issue is one of immediate emotional signifi- 

5A Western analogy is the statements of sooth 
sayers, astrologers, etc.; but reliance upon them for 
public action is marginal. 
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cance—as a famine, or the fate of absent kin. 
When they persist or reappear, it is because 
they correspond to some deep-seated struc- 
tural cleavage. But for the most part they dif- 
fuse quickly through the whole community. 
Whether they do so depends in part on the pos- 
sibility of check (willingness to check being in 
itself an index of some skepticism) and in part 
on whether the rumor is of major or minor 
import. 

I now draw some further implications. To 
begin with, one can classify Tikopia rumors 
into two main divisions: those which are pro- 
saic, fairly simple enlargements or explanations 
of ordinary experiences, and those which are 
fantastic, of the extraordinary kind. The for- 
mer, dealing on Tikopia with the coming of 
ships, the movements of people, the death of 
workers abroad, the cure of illnesses, are set in 
the common cultural mold. They represent 
some of the fruits of speculation about the al- 
ternatives of action. Some are fairly simple 
cases of wrong inference through not having 
access to all the premises, of which one or more 
are concealed from superficial observation. An 
example is the interpretation of a written note 
as my assistant’s intention to stay away for 
the night. 

In them, the instrumental factor® is at a 
minimum, though it may be present to some 
degree in giving opportunity to the rumor- 
monger for ego-assertion. Check is quickly ap- 
plied and currency is short. In other rumors 
emotional elements are more marked. There is 
hope for excitement and novelty, or food sup- 
plies, in the rumored arrival of a ship; there 
is anxiety, as in the rumor of a death. Here, the 
instrumental factor is more marked and more 
complex. To some extent it may be argued that 
the rumor serves as a tension release. By antici- 
pating the event, or possible event, it tones 
down the emotional excitement and makes it 
easier to bear. To put speculation into concrete 
verbal form and give it as a statement, not 
question, may give subtle reinforcement to the 
personality by reduction of the unknown to 
event-shape, to occurrence. Such rumors can 
be scotched on check—but they tend to revive 
again. 

There is quite a close relation between these 

* This is analogous to what Allport and Postman 
have called ’ though it is 
more related to concrete action. (1, p. 169.) 


“expressive significance,’ 


last kinds of rumor, and Tikopia religious spec- 
ulation involving details of spirit behavior. 
Rumors that laborers have died overseas, their 
deaths being reported by radio, are very simi- 
lar to stories that spirit mediums tell of the end 
of men who have gone on canoe voyages, their 
deaths being reported by spirits. Results in 
social action would be in general the same— 
preparation of the family concerned for the 
funeral ceremonies. But there is one significant 
difference, that with the radio there is oppor- 
tunity for immediate check. Application by 
messenger to our house speedily enabled the 
family concerned to discern rumor from truth, 
and to avert their mourning. In ordinary Tiko- 
pia life, when no anthropologists with radio are 
resident, there is no such chance of check, and 
the pronouncement of the spirit medium is 
normally taken for reality. 

In rumors of the fantastic, the untoward 
event, the extraordinary, other elements are 
perceptible. In a rumor of a tidal wave, of the 
suicidal voyage of a man of rank, of the in- 
tended violence of chiefs, there are the fruits 
of speculation of alternatives to action in na- 
ture or in the society, as before. But the instru- 
mental factor is much more evident as a move 
in the field of social action. It may be indirect 
or direct. A case of its indirect operation is 
that of a hurricane and tidal-wave rumor. By 
this, the rumor-monger, it is fairly clear, was 
making a bid to regain for himself and his 
family—the leading missionary element— 
some of the control of public opinion which 
they had lost. The ostensible subject of the 
rumor had little or nothing to do with the ob- 
ject desired. But to be in the position of the 
author of the warning of disaster is tantamount 
to a claim to the special knowledge, and hence 


the power, that no one else possesses. A case 


of rumor with a direct instrumental factor is 
the story of the meeting of chiefs and executive 
officials to decide on the expulsion of com- 
moners. This was a false interpretation of cir- 
cumstances. But its effects were cathartic, the 
crystallizing of possibilities which had been 
before men’s minds for some time. It was 
power potential made concrete. Moreover, it 
could be anticipatory. Rumor may have a 
proleptic function. The effect of the rumor was 
to bring out the commoners with arms pre- 
pared to defend their lands and homes. This 
was not necessary. But if the thought had 
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been in the minds of the chiefs that action of 
expulsion was desirable the effect of the rumor 
was to trip the wire in advance, to bring the 
matter to the surface of discussion and obtain 
reassurances as to its impossibility. To change 
the metaphor—rumor may be said thus to 
draw the teeth of intention. 

In the case just cited, the instrumental fac- 
tor was hardly likely to have been consciously 
employed, after due deliberation. But at a time 
of crisis a man of rank may allow it to be 
inferred that he intends to forau, to take his 
canoe and voyage away, to die or survive as 
the fates decide. This implication is to be dis- 
tinguished from an announcement of intention, 
but grades into it. The result is a rumor which 
spreads through the community, often in exag- 
gerated form. The final outcome of this is to 
mobilize a force of public opinion against the 
rumored action. Other men of rank will see 
that the suicide voyage is not undertaken (4, 
p. 184). And on the other hand, action is usu- 
ally taken to correct the position to which the 
man of rank originally took objection. A fur- 
ther instrumental use of rumor here is as a 
test of public opinion. By making no public an- 
nouncement of intention the man has not com- 
promised himself, and can withdraw if no 
counteraction eventuates, i.e., if the public are 
not seen to be behind him. The “kite-flying”’ 
function of rumor can occur in a primitive so- 
ciety, as well as in a sophisticated Western 
society. 

Hence rumor may be said often to have a 
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positive social function. Its social effects do not 

necessarily involve any individual or group in 

scapegoat-suffering. It becomes an organiza- 
tional mechanism or social instrument in the 

hands of individuals, seeking to remedy or im- 

prove a status position for themselves and the 

groups they represent. It may tend to main- 
tain rather than destroy a social structure. In 

this sense the proleptic function which such a 

rumor may exercise acts as an element of con- 

servatism. But it is a dangerous instrument. 

Whether its use be conscious or unconscious, 

rumor in a primitive as in a civilized society is 

rarely neutral. The fact that it is interest-based 
almost necessarily gives it destructive poten- 
tialities. Whether these are absorbed more 
easily in a primitive than in a civilized society 
only further comparative investigation can 
show. 
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THE USE OF A PHYSICAL SYMPTOM AS A DEFENSE 
AGAINST PSYCHOSIS 
RICHARD C. COWDEN! anp JOHN E. BROWN 
VA Hospital, Gulfport, Mississippi 


YPOCHONDRIASIs is one of the first and 
most obvious symptoms of schizo- 
phrenia (1, 3). Through his physical 

or somatic complaints, the individual may 
deal with many situations because they are at 
once a defense against action, a retribution for 
guilt, and a source of displacement of sensa- 
tion. For example, ill health (a physical symp- 
tom) helps maintain the patient’s position in 
his community. The ailment provides a so- 
cially acceptable excuse for not achieving fi- 
nancial gain, social independence, or marital 
and familial success. ‘““The very doctor who 
treats him will bear witness to the fact; and 
for that reason the patient will not need to feel 
any conscious self-reproaches at using such 
means to handle problems” (4, p. 44). A hypo- 
chondriacal symptom is also a means of main- 
taining contact with his environment when any 
real interpersonal relationship is a source of 
anxiety. 

Since anxiety is usually accepted as one of 
the main stress factors in producing psychotic 
symptoms, it might be expected that a psy- 
chotic’s symptoms would decrease if his anxi- 
ety could be decreased. This experiment at- 
tempted to decrease anxiety in a patient by 
providing a socially acceptable outlet for it— 
a physical symptom. Specifically, it attempted 
to relieve a schizophrenic subject of his psy- 
chotic symptoms by replacing them with a 
physical symptom. 

SUBJECT AND METHOD 

One patient was carefully selected as the 
subject for this experiment. The criteria and 
rationale for selecting the patient were as fol- 
lows: 

1. There should be a unanimous staff diag- 
nosis of schizophrenia. Since the study was 
concerned with providing remission of pri- 
marily schizophrenic symptoms, this point was 
particularly important. 

2. The patient should have had previous un- 
successful treatment with insulin coma therapy 


1The senior author is now at the VA Hospital, 
Chillicothe, Ohio. 
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and electric shock therapy. This requirement 
was a form of protection for the patient and 
the staff. Since other treatments had failed to 
aid the patient, there was justification in trying 
newer, unknown methods so long as they were 
theoretically reasonable. 

3. The patient should have a high level of 
anxiety. This condition was desirable since 
studies have shown that conditioning occurs 
more rapidly in anxious individuals (5, 6). 

4. There should be a previous history of 
some somatic complaint which was not of a 
nature to require hospitalization and continu- 
ous medical treatment. As in hypnosis, so in 
this study, the experimenter wanted to take 
advantage of nature in using a response high in 
the available hierarchy. Thus, an already avail- 
able symptom was selected rather than an en- 
tirely new one. For, as Freud points out, “The 
production of a conversion symptom is so dif- 
ficult that an impulsion towards discharge of 
an unconscious excitation will so far as possible 
muke use of any channel for discharge which 
may already be in existence’’ (4, p. 53, italics 
added). 

On the basis of these considerations, a pa- 
tient was selected for the experiment. Follow- 
ing is a summary of his hospitalization and 
history describing him before the experiment 
was undertaken: 


This 43 year old World War II veteran was ad- 
mitted to this hospital for the first time in 1953. As a 
child he was described as being slightly precocious and 
of having some difficulty in getting along with people. 
After completing nine grades of school, he began 
working on construction jobs at an early age. He was 
inducted into service in 1942, and while overseas as a 
radio operator on a B-29, he was shot down in 1945. 
He gave a history of having injured his back falling 
from a tree after using his parachute. He was a POW 
for almost a year. 

He had had three neuropsychiatric examinations, 
in 1947, 1950, and 1951, resulting in a diagnosis of 
anxiety reaction. Hospitalization became necessary in 
1953 when he felt that people had it in for him and he 
began carrying a gun for protection. He had prominent 
ideas of reference connected with his contracting job. 
Following a course of ECT, he improved and was 
discharged with maximum hospital benefits in 1954, 
diagnosed as schizophrenic reaction, paranoid type. 
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Two months later it was necessary to hospitalize him 
again. Following this admission he showed inappro- 
priate affect, was quite grandiose when talking about 
himself, preoccupied with bodily states, tremulous, and 
anxious. He was given a course of deep insulin coma 
therapy, which was completed after 38 treatments 
and 31 comas. A month after the insulin treatment, 
examination showed the patient continued to be con- 
stricted and rigid, utilizing inadequate compulsive 
and reaction-formation Religious ideation 
and projected guilt were still strong. Nevertheless, he 
stated that he was ready to leave the hospital. Despite 
this push to leave and stated optimism, however, his 
affective structure, with depression and anxiety pre- 
dominating, seemed unchanged. His defenses were 
poorly coordinated, and he was extremely anxious 
and insecure. Projection and autistic distortions of 
reality were utilized under relatively little stress. He 
was extremely meticulous ir his work, very careful 
and perfectionistic. 


defenses. 


Shortly after this summary was written and 
he had been given the Rorschach, Bender- 
Gestalt, H-T-P, a sentence-completion test, 
and a modified TAT, the attempt to give this 
man a physical ailment was undertaken. The 
symptom chosen was a backache. This symp- 
tom was selected because the patient’s history 
suggested that it would be more acceptable to 
him. No actual difficulty existed however. 

A “conditioning” method, extending over a 
period of three months, was followed with all 
staff members having contact with the patient 
involved in the actual production of the con- 
version. X rays were made of his back. Every 
morning, while making rounds, the ward physi- 
cian discussed the patient’s “back pain” with 
him, actually starting the experiment by call- 
ing the symptom to the patient’s attention. At 
the same time, he was seen in thirty-minute 
individual psychotherapy twice 
weekly, during which time the discussion was 
constantly centered on the selected symptom. 
The physical therapist provided heat treat- 
ments when requested. In occupational ther- 
apy he was cautioned not to do any heavy 
lifting. Social Service helped him in planning for 
a job with the “ailment” kept always in mind. 


sessions 


RESULTS AND DISCUSSION 


At the end of this three-month period, the 
patient was retested and brought before the 
staff to be considered for discharge. He told the 
staff that he now realized his difficulty was 
caused by a severe back injury which caused 
him to get anxious and upset. He realized that 
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his ailment would restrict his activities some 
but not enough to interfere with his leaving the 
hospital. The following is a report from a social 
worker to whom he reported a few days later 
in another community: 


He spoke of his doctor with confidence, at the same 
time expressing much hostility towards the VA for 
“having failed to recognize that his trouble had been 
a fracture of his spine,’ which occurred when he bailed 
out of an airplane. He said for eight years he had 
suffered from his back and all that VA doctors did 
was to tell him it was “his nerves.”’ In an attempt to 
get relief, veteran took sedatives (he implied these 
were narcotics), which, “together with the pain got 
him in a bad way.” Finally, during his present hos- 
pitalization, his doctor “listened to him” and gave 
him heat treatments which relieved him. He now feels 
he knows what to do should the pain recur. 


Further evidence that the patient was using 
his back pain as a defense was observed in his 
last testing. Before the physical symptom was 
developed, he stated, “I think I am better be- 
cause I have regained contact and favor with 
God.”’ Three months later, he answered the 
question,” In what way do you think you are 
better?” with “... discovery that heat treat- 
ment is wonderful for my back. I have de- 
veloped control over pain.” The Rorschach 
showed little change except for a drop in the W 
responses from 37 per cent to 10 per cent. Some 
of the other tests gave an indication of a less- 
ening of tension and anxiety. His handwriting 
was less cramped and tremulous. The Bender 
showed less erasing, heavy shading, and tremu- 
lousness. The H-T-P showed some improve- 
ment in that the drawing of a person was more 
complete. Of interest was the presence of hands 
on the second drawing whereas none had been 
drawn the first time. When asked to draw a 
picture of himself, he was unable to improve 
his previous performance (2). The sentence- 
completion test and TAT likewise showed little 
change in ideation or attitude. 

In general, the results of the tests indi- 
cate that little real personality change oc- 
curred as a result of this unusual! treatment. 
But theoretically none was expected. The 
treatment made no attempt at changing his 
ideas or attitudes or basic personality struc- 
ture; its aim was to provide a new outlet for 
the expression of his illness which would be 
socially acceptable. Through this process, a 
great deal of the anxiety appeared to be 
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alleviated. The final clinical evaluation made 
at the hospital by the ward psychiatrist was 
as follows: 


At the time this patient left the hospital on trial 
visit he had a feeling of well-being, was somewhat 
euphoric and grandiose, with some expansive plans 
for employment after leaving the hospital. He tended 
to be ingratiating, loquacious, but always coopera- 
tive. His interpersonal relationships here had shown 
considerable improvement, and he showed no evi- 
dence of being delusional or hallucinated. There was a 
marked lessening in the manifestation of overt anx- 
iety. He frequently had complaints of pains in his 
back, and occasionally had mild upper respiratory 
infection. There was no sensorial disturbance. There 
was very little insight. 


He is now out of the hospital and at last 
report making a satisfactory adjustment. The 
last time he left a hospital he had to be re- 
turned in less than two months. He has far 
exceeded this period on this trial; at the origin 
of this report he had been in the community 
for nine months. 

These results are consistent with Freud’s 
contention that, .falling ill involves a 
saving of physical effort; it emerges as being 
economically the most convenient solution 
when there is a mental conflict” (4, p. 43). It 
also suggests that this approach may be of 


some practical and economic benefit to hos- 
pitals in dealing with certain patients who 
have failed to improve under more customary 
treatment. It in no way “cures” the psychosis 
itself but merely provides the patient with a 
more socially acceptable manner of manifest- 
ing it. Perhaps the germ of a psychosis lies 
within all individuals; the difference between 
the patient and nonpatient may not be the 
actual conflicts and stresses but the means of 


handling them. 
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CRITIQUE AND NOTES 


THREE DETERMINANTS OF THE LEVEL OF ASPIRATION 


JAY HANSCHE anp J. C. GILCHRIST 


University of Wisconsin 


_ A study of the effects of reference groups on 
the level of aspiration, Chapman and Volk- 
mann (1) demonstrated that the level of aspira- 
tion was inversely related to the prestige value of 
the reference group. In their study, a “test of 
literary acquaintance” was given to groups of Ss 
who were informed of the mean performance of a 
reference group on the test and who were then 
asked to give their aspiration-level. Three refer- 
ence groups of different prestige value were 
utilized and all were attributed the same per- 
formance As the prestige value of the 
reference group increased, Ss aspiration-level 
decreased, but Ss given the lowest prestige refer- 
ence group, “an unselected group of WPA 
workers,” though they had the highest aspiration- 
level, still aspired below the stated performance 
level of their WPA reference group. The college 
student Ss had a mean aspiration-level of 33.05 
when told the WPA workers averaged 37.2. 
Three variables seemed possibly to be affecting 
the situation: (a) the prestige value of the refer- 
ence group, (b) the absolute value of the reference 
score, and (c) the difficulty of the sample ques- 
tions used in the test directions. The present 
experiment investigates the effects of these three 


ievel. 


variables. 
METHOD 


A pretest was designed to quantify the dimension of 
sample question difficulty. The Ss were asked to answer 
a series of multiple-choice questions, and on a five-step 
rating scale, to state the degree to which they were 
certain of each of their answers. Acting on the hypothe- 
sis that uncertainty of answer is due to difficulty of 
question, three sets of two sample questions each of 
low, medium, and high difficulty were established on 
the basis of their uncertainty scores. The mean uncer- 
tainty scores of the sample questions were 1.5, 2.7, and 
3.8 for the low, medium, and high difficulty sets respec- 
tively. Hereafter, levels of this variable will be repre- 
sented by Qi, Qs, and Q; in order of increasing difficulty. 

The reference groups were chosen from the same, a 
scholastic, frame of reference. In increasing order of 
prestige, they were (a) high school seniors, (4) college 
sophomores, and (c) first year graduate students in 
psychology. Three reference scores were chosen: low, 
54.4%; medium, 69.4%; and high, 84.4%. Following 


Chapman and Volkmann, the directions sheets read as 
follows with appropriate changes for the several con- 
ditions: 

“On the following pages is a test of general knowledge 
of psychology. It consists of 50 questions like the fol- 
lowing: 

Example 1. The most common type of color 
blindness involves: 

a. red-green 
b. blue-yellow 
c. red-yellow 
d. blue-green 
Example 2. Which practice requires an M.D.? 
a. psychiatry 
b. psychoanalysis 
c. clinical psychology 

“Each question is to be answered by placing the 
letter corresponding to the one correct answer in the 
blank before the first word of the question. If you do 
not know the correct answer, make your best possible 
guess. 

“Your score on the test will be the per cent of ques- 
tions which you answer correctly. You cannot, there- 
fore, obtain a score higher than 100%, and since there 
are three or four possible choices for each question, you 
would probably obtain a score between 25% and 33% 
by simply guessing. 

“This test has been given to a group of first year 
graduate students in psychology who made an average 
score of 84.4%. 

“Before turning the page, indicate on the line below 
the per cent correct which you expect to make on this 
test. 


orn 
/€ 


The combinations of the variables in the 3 X 3 & 3 
design yielded 27 forms of directions for the test. Ten 
copies of each form were distributed randomly to 270 
students in a large elementary psychology course. The 
experimental measure utilized was.S’s level of aspiration 
from his directions sheet. 





RESULTS 


Table 1 presents the mean aspiration-level of 
each of the experimental conditions. The means 
of the main effects are given in Table 2. 

A summary of the analysis of variance is pre- 
sented in Table 3. It shows the effects of reference 
group and score to be highly significant, and the 
effect of question difficulty to be significant not 
only when isolated, but in the first order interac- 
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TABLE 1 
MEAN AsPIRATION-LEVEL UNDER ALL CONDITIONS 


> 
Reference Question Reference score 


group difficulty 


54.4 69.4 84.4 


High school 

















Q1 74,2 81.2 80.4 
seniors Q: 76.7 75.0 81.2 
QO, 64.4 77.2 54.5 
College Q: 57.8 70.7 80.3 
sophomores Q: 63.7 68.1 78.5 
Q; 57.7 1p 83.4 
First year Q; 47.0 54.7 56.9 
graduate Qs: 57.6 53.4 65.2 
students Q; 36.0 50.9 70.4 
TABLE 2 
OVER-ALL MEANS 
Source Low Medium High 
Question difficulty 67.2 69.8 62.9 
Reference score 59.4 66.9 72.3 
Reference group 73.8 70.1 54.7 
TABLE 3 
SumMMARY OF ANALYSIS OF VARIANCE* 

Source af MS | F | ~p< 
Reference Group 2) 9,315.6 50.7 | .001 

(RG) 
Reference Score (RS) 2 | 3,745.5 | 20.4 .001 
Questions (Q) 2) 840.5) 4.57 | .025 
Q xX RG | 4 585.1 | 3.18 | .025 
Q x RS | 4 403.0 | 2.19 -- 
RG X RS 4) 1a) — | — 
Q X RG X RS 8| 469.4] 2.55 | .025 

Residual (within) 243 183.8 | 

| 

Total | 269 | 





* Since the variance was heterogeneous the usually acceptable 
confidence level was halved. Thus no result was considered sig- 
nificant unless p < .025. See Lindquist (2, p. 78-86). 


tion with reference group as well as the second 
order interaction. 


DISCUSSION 


In a design similar to that of Chapman and 
Volkmann, the effect of reference group was 
again found to be a significant anchoring point 
for Ss’ aspiration-level. The effects of two other 
dimensions of the design, sample question diffi- 
culty and reference score, have also been found to 
have significant anchoring influences which ex- 
plain the puzzling portion of Chapman and 
Volkmann’s results. In that phase of this study 
in which the three reference groups were coupled 
with high reference scores, Ss’ aspiration-levels 
were below that of any prediction based on the 
anchoring effect of the reference group alone. It is 
likely that Ss considered performance over this 
level to be too closely approaching perfect per- 
formance and refused to set too difficult levels of 
aspiration for themselves. As an additional effect 
of the value of various reference anchors, Ss con- 
fronted with the high performance level of a 
relatively low status reference group seemed to 
reject the reference group as an anchor and based 
their aspiration-levels almost exclusively upon 
the anchor of sample question difficulty. In 
other words, where the various anchor stimuli 
are incongruous, Ss tend to reject those which 
are less bound to the task, in this case reference 
group and reference score. This last finding raises 
a significant question: What determines the ac- 
ceptance of one reference point (e.g., question 
difficulty) to be accepted as the criterion for 
judgment while others are rejected? It is to this 
issue, it would seem, that some research need be 
directed. 
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ETHNOCENTRISM AND STIMULUS GENERALIZATION’ 


FRANKLYN N. ARNHOFF 


University of Nebraska College of Medicine and the Nebraska Psychiatric Institute 


‘ha phenomenon of stimulus generalization, 
which has been well documented and experi- 
mentally demonstrated (2, 3, 4), has recently come 
under increased scrutiny with studies relating it to 
clinical anxiety (5, 13) and various personality 
correlates (6) including schizophrenic and brain- 
damaged patient performance (9). Ethnocentrism 
has recently been explained as an “overgeneraliza- 
tion phenomenon” (10), with at least two studies 
appearing in the last few years to give some support 
to this hypothesis (7, 11). The interpretation has 
not, however, been demonstrated quantitatively. 
This exploratory study attempts to show that 
ethnocentrism is related to overgeneralization in 
thinking, which transcends the immediate attitude 
of prejudice and has broader implications for 
theory and research. It was hypothesized that 
degree of generalization (number of errors on a 
stimulus-generalization task) is positively related 
to the subject’s scores on a scale of ethnocentrism. 


METHOD 


The subjects (Ss) were 25 female student nurses who 
were taking psychiatric training at the Nebraska 
Psychiatric Institute and who volunteered to serve. 
Ages ranged from 19 to 22, except for one S who was 27. 
The majority had spent most of their lives in the more 
rural areas of the state and had educational back- 
grounds ranging from high school graduation through 
two years of college. 

The apparatus used for the stimulus-generalization 
task has been more fully described elsewhere by Brown 
(4) whose method and procedure were also followed. 
The apparatus consisted of a six-foot, curved plywood 
panel with seven 115-v., 7.5-w. frosted lamps mounted 
horizontally. The lamps were uniformly spaced and 
were all equidistant from the S who was seated five 
feet away. A red jewelled pilot lamp, located two 
inches above the center lamp, served both as a fixation 
point and ready signal as it was turned on, in random 
order, 3-5 seconds prior to the lighting of the stimulus 
light. The S sat with the index finger of the preferred 
hand holding down a reaction key, and the latency of 
response was measured to the nearest one-hundredth 
second by means of a Standard Electric Timer. The 
experimenter (£) sat behind the panel and by means of 
switches turned on any of the seven lights following the 
lighting of the ready signa!. 

After Ss were comfortably seated at the proper 
distance, instructions were read stating that this was 
an experiment on reaction time and that they were to 


1 The author wishes to thank Dr. Judson S. Brown 
for his assistance with the details of the apparatus and 
procedure, and Miss Norma Johnson, nursing in- 
structor, for her help in contacting and arranging for 


subjec ts 
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react as quickly as possible to the lighting of the central 
lamp by lifting their fingers from the key. They were 
informed that other lamps would occasionally be 
lighted to which they were not to respond, and that they 
should not be upset by errors, but to keep responding 
as quickly as possible. 

Following the reading of the instructions, Ss were 
given 25 training trials to Lamp 4 (center lamp). After 
this, without interruption, 84 trials to Lamp 4 were 
given with 24 trials to test Lamps 1-3 and 5-7 inter- 
spersed among them. Each of the three lights to the 
left and right of the center lamp appeared four times. 
Six different orders of test trials were used, each begin- 
ning with a different test lamp. Four Ss were tested 
under each of the six orders. As the number of errors 
(responses to lamps other than Lamp 4) was the 
primary consideration rather than the shape of the 
generalization curve, the 25th member of the class 
was also used as an S under one of the orders. 

Each S was identified by a number rather than by 
name in order to preserve anonymity. After all Ss had 
performed the stimulus-generalization task, they were 
seen as a group and given the California Ethnocentrism 
Scale (1, p. 129), identifying their papers by means of 
the previously assigned numbers. Upon administration 
of the E scale it was discovered that one question 
regarding “‘zoot suiters’”’ had no meaning to the majority 
of the group as they did not know to what it referred. 
Consequently, this item was dropped from the scale 
because of its obvious invalidity 


RESULTS AND DISCUSSION 


In scoring the E scale, 40 points were added to 
each score to eliminate negative numbers, facilitat- 
ing treatment of the data. Generalization is here 
defined as the number of errors made by responding 
to lamps other than the one to which Ss had been 
trained to respond. The total number of errors 
made by each S on the stimulus-generalization 
task (mean, 6.16; SD, 3.21) was then correlated 
with the individual scores on the E scale (mean, 
36.44; SD, 9.17.) The obtained value of r was 0.44, 
significant beyond the .05 level for 23 df (p = .05, 
r .396). Analysis of reaction times showed no 
trends or significant differences between Ss. 

The results of this study further suggest that 
the ethnocentric individual makes his judgments 
and reacts on the basis of overgeneralized thinking. 
A limiting factor to this interpretation, however, is 
the well-demonstrated inverse relationship between 
ethnocentrism and intelligence (1). As it was not 
possible to obtain intellectual measures on the Ss 
used in this study, this leaves open the alternative 
explanation that the obtained relationship between 
stimulus generalization and ethnocentrism may be 
due in whole or part to uncontrolled intellectual 
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variation between Ss, despite the otherwise homo- 


geneity of the group. The relationship between in- 


telligence and stimulus generalization per se re- 
mains undemonstrated with the distinct possibility 
to consider, however, that:these two variables may 
lso show an inverse relationship. It is evident, 
however, that the attitude of the prejudiced indi- 
vidual does have wider implications and correlates 
as his performance on various cognitive tasks has 
been shown to differ from that of the nonprejudiced 
(7, 11 
. } ley? 
In discussing the assumptions underlying the 


phenomenon of empirical stimulus generalization, 


Brown (4) criticized the explanation offered by 
Lashley and Wade (8) and Razran (12) who stated 
that generalization is attributable to a failure to 
note distinguishing characteristics of the stimulus. 
In pointing out the circularity of such an explana- 
tion, Brown states, “It is evident that if one is to 
use failure to note to explain empirical generaliza 
tion, the failure to note must be defined 
independently of the particular generalization re 
action it is supposed to explain” (4, p. 58). In 


ascribing attributes and characteristics to an 


individual on the basis of his group affiliation, as 
does the prejudiced, he is certainly failing to note 
distinguishing characteristics. Fully realizing the 








limitations of the present study and the need for 
more extensive investigation, it is suggested that 
the relationship expressed here between stimulus 


generalization and ethnocentrism may supply a 


necessary independent definition and give strength 
to the more parsimonious explanation of empirical 
generalization as being due to a failure to note 
distinguishing characteristics of the stimulus. 
More conclusive theorizing along these lines, how- 


ever, should be delayed until some attempt has 


been made to investigate the relationship between 
the various measures of generalization that have 
been employed in this and previously reported 
studies. Research along these lines is currently in 


progress by the author and colleagues 


SUMMARY 


This study is aa attempt to demonstrate that 
ethnocentrism is related to an “‘overgeneralization 
process” as has been postulated by various authors. 


The performance of a group of student nurses on 
a stimulus-generalization task was correlated with 
their scores on the California E scale yielding a 
significant r = .44. Some possible theoretical impli- 
cations are discussed and mention is made of 


further research which is in progress 
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4 VALIDITY CHECK ON MMPI SCALES OF RESPONSIBILITY 
AND DOMINANCE! 


DONALD W. OLMSTED anv ELIO D. MONACHESI 


University of Minnesota 


(,* GH, McClosky, and Meehi have presented 
personality scales for Social Responsibility 
and Dominance which are partly made up of items 
from the MMPI (1, 2). The scales were derived 
from a study of high-school and college students 
rated on responsibility and dominance by peers 
and teachers who were aided in making their 
ratings by being presented with a descriptive 
definition of what the investigators meant by the 
term. 

The 56-item Responsibility scale includes 32 
MMPI items, which as a separate scaie correlated 
.84 and .88 with the total scale on two samples 
reported (2, p. 78). The 60-item Dominance scale 
includes 28 MMPI items. The correlation between 
the MMPI Dominance items and the total scale 
is not reported, but the correlations between the 
MMPI Dominance items and the criterion rat- 
ings (.524 for a college sample, and .597 for a 
high-school sample) are almost as high as the 
correlations between the total scale and the cri 
terion ratings (.599 for the college sample, and 
.687 for the high-school sample) (1, pp. 364-365). 


conclusion was 


In the case of 
drawn that they appeared to have adequate re 


“tor the 


both scales, the 
validity specification of 
group trends” and differences in most situations 
(1, p. 366, 2, p. 79). Both scales are mentioned 
ision of the MMPI Manual (3, pp 


liability and 


in the 1951 rev 
21-22 


PROCEDURES AND RESULTS 


The present effort attempts to present evidence 
concerning the validity and limits of applicability 
of the MMPI portion of the Responsibility scale 
in terms of an external criterion and for a quite 
different sample of subjects. The MMPI portion 
of the Dominance scale is “‘tested’’ less directly. 
The subjects are 262 Minneapolis firemen and 
25 fire captains, representing about 93 per cent 
of the firemen on duty on several testing days, 
(July, 1953) or about 54 per cent of the men in 
the fire The the 
attainment of a captaincy on the the 
assumption that the captains, on the average, 
possess “responsibility” to a greater extent than 
the firemen. With less confidence, the same cri- 


department. criterion used is 


asIS OF 


1 This paper reports a portion of the research con- 
ducted pursuant to Contract No. N&8onr66216 be 
tween the Office of Naval Research and the Univer- 
sity of Minnesota. The statements expressed are those 
of the authors, and do not necessarily represent the 
views of the Office of Naval Research. 
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terion is adopted “dominance.” Ii 


concerning 
these criteria are accepted, the captains should 
score significantly higher than the firemen on valid 
responsibility and dominance scales. Data for the 
lable 1. Although the 
captains’ mean raw score on the 32 MMPI Re 
sponsibility items was slightly higher than the 


subjects are shown in 


firemen mean, the difference is not statistically 
significant, and it certainly is not of a predictively 
useful magnitude. Since the distribution of the 
scores was somewhat skewed, they were normal 
ized (and a constant added for computational 
Table 1 
also shows, the captains had a mean raw score on 
the 28 MMPI Dominance items slightly lower 
(nonsignificantly) than had the firemen. 

The articles presenting these scales give data 
indard deviations of 
high bet hool 


of the scales 


convenience), with a similar result. As 


concerning the means and st 
the raw 
students on the 


scores of the college and 


MMPI 


portions 


These are compared with the firemen data in 
Table 2. The firemen means (including captains) 
are somewhat higher on both measures. With 


TABLE 1 
COMPARISON OF MEAN Scores Or 262 FIREMEN AND 25 
FirE CapTrarins ON MMPI Items or SOociIAL 
RESPONSIBILITY AND DOMINANCE SCALES 


Captains Firemen — 
Scale — Be SEa t One 
’ ,,| tween tail 
Mean SD |Mean| SD | nfeans , 
Respon 20.16) 4.62) 19.19. 4.39 97 9235| 1.049 .15 
raw) | 
Respon 32.20 10.80, 30.08 9.82 2.12 (2.0734) 1.024 .15 
normal 
Domin 15.36 3.86 1 93.31 —.43 7039 —.611 
raw 


TABLE 2 
COMPARISON OF MEAN ScorEs OF 287 FIREMEN AND 
676 StTupENTS ON MMPI Items oF Sociar 
RESPONSIBILITY AND DOMINANCE SCALES 


‘ iffe 
Firemen Students* Differ 
ence 
Scale Be SEa t Two- 
. tween tail) 
Mean, SD |Mean SD | Means 
Respon. 19.28) 4.41 | 18.53 4.48 75 314 | 2.37 | <.05 
raw) 
Domin 15.76 3.36 | 14.64)3.70 1.12 254 | 4.40 <.01 
raw 


* Student data from (1, p. 365, and 2, p. 79) 











REPLY 
respecc to Respons bility, the difference between 
means of .75 units is small enough, and the stand- 
ard deviations similar enough, to indicate that, 
whatever this set of items measures, the “tenta- 
tive norms” established in the original paper may 
be applicable to adults. 

In the research of which this is a portion, it 
was desired to determine whether the MMPI 
Responsibility and Dominance score means for 
the fireman work-groups could be regarded as 
meaningful measures of group chara“teristics. Ac- 
cordingly, the means on these scales of the 40 
fire-station work shifts were subjected to analysis 
of variance tests. The F ratio was not significant 
for either scale, indicating that the group means 
on these measures differed only randomly. Alter- 
native ways of stating the conclusions would be 
that the measures do not discriminate among the 
groups, or that, assuming the scores reflect an 
aspect of personality, this aspect of personality is 
not differentially distributed in the groups 


CONCLUSIONS 


1. The mean values on the MMPI Responsi- 
bility and Dominance items obtained by the 
present sample of adults were slightly higher than 
nal sample of high- 


those obtained by the orig 
school and college students. The differences, while 
statistically significant, are not marked, and the 
tentative norms presented with the scales may 
be roughly appropriate for adults 
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2. The MMPI portion of each scale failed to 
yield mean values for fireman work-groups (sta- 
ion shifts) which were indicative of differences 
among the groups in whatever was being measured. 

3. Twenty-five Minneapolis fire captains did 
not differ significantly from 262 Minneapolis fire- 
men in mean scores obtained on the MMPI items 
of the Social Responsibility and Dominance scales. 
If the attainment of a captaincy is granted as an 
appropriate criterion, the data appear to consti- 
tute evidence against the validity of at least the 
MMPI portion of the Responsibility scale. The 
criterion may be less appropriate with respect to 
Dominance. Inasmuch as the MMPI Responsi- 
bility items were reported in the original study 
as highly correlated with the total Responsibility 
scale, some question about the validity of the 
latter may also be raised. 


REFERENCES 


1. Goucu, H. G., McCuiosxy, H., & Meent, P. E. A 
personality scale for dominance. J. abnorm. soc. 
Psychol., 1951, 46, 360-366. 

2. Goucnu, H. G., McCrosxy, H., & Meerut, P. E. A 
personality scale for social responsibility. J. 
abnorm. soc. Psychol., 1952, 47, 73-80. 

3. Hatnaway, S. R., & McKritey, J. C. Manual, 
Minnesota Multiphasic Personality Inventory. 
(Rev. Ed.) New York: Psychological Corp., 
1951 


Received August 8, 1955. 


TO SCOTT 


LEWIS BERNSTEIN! 


Veterans Administration Hospital, Denver, Colorado 


cort (4), in a recent article in this Journal, has 
S reported that he was unable to confirm some 
findings concerning the effects of handling albino 
rats previously reported by this investigator and 
co-workers (1, 3). His failure to obtain comparable 
findings may be a function of differences in de- 
sign and procedure between his and our experi- 
ments. 

Scott gentled his animals for three weeks after 
weaning, in contrast to our reported five weeks of 
petting (3). After his three-week period he ob- 
tained differences in weight gains in favor of his 
gentled animals. Although these differences were 
not statistically significant, the trend was in the 
predicted direction. If Scott had continued the 
gentling for an additional two weeks he would have 
replicated at least part of our experiment. This 

The writer wishes to express his thanks to Drs. 
John D. Benjamin, Karl F. Muenzinger, and Howard L. 
Siple for their comments in preparing this communica 


difference appears important because our data 
show that the greatest weight gains in our animals 
occurred during the fourth and fifth weeks of the 
experimental period. 

Scott refers to a personal communication from 
the author (2) as indicating that there were no 
differences in water intake between our groups of 
animals. No mention of water intake was made in 
that communication. The only paper in which we 
mention water intake (3) was not referred to by 
Scott; and in that paper we report that in one 
of the three experiments the unhandled animals 
drank significantly more water than the extra- 
handled animals. 

We have reported as one of our major findings 
that a cessation of handling seriously disrupts the 
previously handled animals’ behavior. For ex- 
ample, ‘handled animals with whom the relation- 
ship was interrupted made significantly more errors 
than animals with whom the relationship was con- 
tinued during extinction trials. In fact, the animals 
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with whom the relationship had been interrupted 
made more errors than the unhandled animals. In 
other words, an interrupted relationship produced 
5, pe 
39). Similarly, in an as yet unpublished study, the 


more errors than a minimal relationship” 


animals with whom the relationship was inter 
rupted showed death 
following intraperitoneal injection of thiourea than 


significantly higher rates 
did animals with whom the relationship was con- 
tinued. With these findings, we would predict that 
the discontinuance of the handling would disrupt 
the behavior of previously handled animals on all 
of Scott’s tests; and all of Scott’s tests following 
his three weeks 
animals who were closely analogous to our inter- 


of handling were precisely with 


rupted relationship group. Scott’s gentling stopped 
when his animals were 44 days old. His test situa- 
tions were not begun until the animals were 119 
days old, and the thiourea injections were not 
given until they were 136 days old. In other words, 
Scott was primarily concerned with mature ani- 


mals, while our effects appeared during, and im- 
mediately after, handling of immature rats. On 
this basis, one can question the validity of Scott’s 
statement that his findings in regard to the fearful- 
ness of his animals failed to confirm those reported 
by us. We reported greater activity on the part of 
our handled animals in maze behavior and home- 
cage activity during the time they were being 
handled (1, 3). 


75 days after cessation of handling, by placing his 


Scott, however, tested fearfulness 


Lewis BERNSTEIN 


animals in a Miller-Mowrer box for two 3-minute 
periods. 

In the personal communication to Scott, it was 
stated that the animals were injected “... intra 
peritoneally with thiourea in dosages of 50 mg. per 
kilo body weight’”’ (2). It is difficult to understand 
how injections in the thighs of 100 mg. per 100 gm 
body weight (20 times our dosage) can be con 
sidered comparable to the injections used in our 
study. 

In summary, Scott reports a failure to confirm 
our findings on the basis of a study which was not 
a replication of ours. To the extent that his animals 
were closely analogous to one of our five groups 
(the interrupted relationship group), his findings 
tend to confirm one of our major findings, viz., 
that an interrupted relationship sericusly disrupts 
the behavior of previously handled animals. 
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